Gene Slin_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3333 
Symbol 
ID8727086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4030974 
End bp4032155 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content55% 
IMG OID 
ProductCystathionine gamma-synthase 
Protein accessionYP_003388142 
Protein GI284038212 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTT CGAACTACGA AAAAAGTACA GCCTCTGTCT GGGCGGGCGA AACCGACGTG 
TTACCCAACG GTGCAGTCAC CACGCCCATC GTAAAAAGTG TTGCCTTCGC TTACACCGAT
CTGGACGAAT GGCACGAAGT AGCCCTGGGT AAAGCCGAGG GTTTTATCTA CAGCCGAAAC
ACCAACCCAA CCGTACACGT ACTCGAAGAA AAAATCCGGA TTCTGGAAGG TGCCGAAGCC
GCCACGGCCT TTGCCACCGG TATGGGTGCC ATCAGCAACA CCCTGTTTGC GCTACTGGGG
CCCGGCAAAC GGGTGGTTTC TCTGAAAGAT ACATACGGCG GAACCAGCCG TTTATTTCTG
GATTTCCTAC CCCGCTACCA GGTAAACGTA ACCCTTTGCG ACACTACCGA CTTCGACCAG
ATTGAAGCCG AAGTAGCCAA AGGCTGCGAC GTTCTGTATC TCGAAACGCC TACTAACCCT
ACTCTCAAAG TGGTGGACCT CGCCCGACTA GCCGCAGCCG CCAAAAAAGT GGGAGCCGTT
ACGGTGGTCG ATAATACCTT CGCGACACCC ATCAACCAGA ATCCGCTGGC CCTGGGTGCC
GACCTCGTGC TGCACAGTGC GACCAAGTTC CTGGGTGGTC ACTCCGATGC TATGGGGGGC
GTGCTATGCG GCAGTAAAGA ACTGGTCAGC AAGGTGTTCC AGTTTCGCGA AATAAACGGA
GCCAGCATTC AGGCCGATGC CGCTTATATG ATTGCCCGAG GGATGAAAAC GCTCGAACTG
CGTATCGAAC GGCAAAACGC GTCGGCCCTA ACCATTGCGC GGTATCTGAA AGCCCATCCT
AAAGTCAGCG ACGTCTTTTA TCCGGGACTT GAAACGCACC CCGGCCATGA GATTGCCAAA
TCGCAGATGT CGGGGTTTGG GGGTATTATG AGTTTCTCGC TGAACGGCGG CTATGAACAA
GTCAAAACGT TTTTACCGAA GCTCCGGTTT GTGCATCTGG CCGCCAGCCT GGGTTCGGTG
AGCACGCTGG CCGGACCACC CCGAACCACC AGCCACGTCG AACTAACCGA GGACCAGCGC
AGGCAGTTGG GCATTCCAGA AAGCCTGATC CGGTACTCAG TCGGCATTGA GAATGTAAAT
GATTTGCTAG CCGATCTGGA ACAGGCATTG GCTGCTTTGT AA
 
Protein sequence
MDFSNYEKST ASVWAGETDV LPNGAVTTPI VKSVAFAYTD LDEWHEVALG KAEGFIYSRN 
TNPTVHVLEE KIRILEGAEA ATAFATGMGA ISNTLFALLG PGKRVVSLKD TYGGTSRLFL
DFLPRYQVNV TLCDTTDFDQ IEAEVAKGCD VLYLETPTNP TLKVVDLARL AAAAKKVGAV
TVVDNTFATP INQNPLALGA DLVLHSATKF LGGHSDAMGG VLCGSKELVS KVFQFREING
ASIQADAAYM IARGMKTLEL RIERQNASAL TIARYLKAHP KVSDVFYPGL ETHPGHEIAK
SQMSGFGGIM SFSLNGGYEQ VKTFLPKLRF VHLAASLGSV STLAGPPRTT SHVELTEDQR
RQLGIPESLI RYSVGIENVN DLLADLEQAL AAL