Gene TM1040_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3036 
Symbol 
ID4075741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp3409 
End bp4425 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content59% 
IMG OID638004537 
Productsulfonate ABC transporter, periplamic sulfonate-binding protein 
Protein accessionYP_611272 
Protein GI99078014 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTCA ACAGACGACA GACATTGGCG CTAATGGGCG CCGCCGCTGC ATCAGGCCTC 
GCCGCTCCGG CGCTTGCTTC TGGCAAAAAA CCGGTGGTGG GCGCGCTGAG CCTCACGAGC
CATTCTGGCA GCTTTATCGC GCTGGAGCGG GGATATTTCA AAGAGGCGGG CCTCGATGTC
GAGCTCAAGT TCTTTCAAGC CGCACAGCCC ATGGCCGTTG CCATCGCCTC CGGTGACGTA
GATTTCGGCG TTACCGCGAT TTCCGGCGGC CTCTTGAGCC TTGCAGACAA AGGCGCGGTC
AAGGTGATTG GCGGCGCGCT ATCCGAGGAA CCCGGCATCG ACGGGCAGAA GATCCTTGCC
TCTGATGCAG CCTATCAAGC GGGGCTCACG TCGGTCGCGG CTCTGGATGG CAAACGCTAC
GGGATGACCA CTGCGGGATC GTCCTTTCAC TACATGGGCT CCAAGATCGC TGGCGCTGAA
GGCGGGACGC CGCAGTTTGT GCCACTGCAA AAGGTTGGCG CGATTATTGG CGCGCTGAAA
TCGGGTCAGA TTGATGCCTG GTCCATCGTA CCCCATATCG CAAAGCCGCT CGCAGGCTCG
GGCGCGGTGC ATATCATCGG CAATGTCGCG GACTATCTGC CGAATTACCA GGTCACAACT
GTCTTTACCT CTGCGCAGAA CGCGAGCAAG GAACGCGGTC TGACAGAGAG CTTCCTCAAG
GGCTTTGGCA TGGGGGTGTC GGATTACAAC GCCACCATGG TCGACAAGCA AAACGGTGAG
GACGCCATCA ACGAGATGGT CGATCTGATC CACAAATATG TCTACACCGA CCGCCCGCGC
GAAAAAGCAG CGCCGTCGAT CATCAATGGG TCCATGCGTC TCAACAAAGA TGCTGCGATC
AATGTGGCCT CGGTGTCTGA TCAGCTGGCC TGGATGCAGT CGGAGGGCCT TGTCGATGCC
GGGATCACGC TCGAGACCTT CCTCGATACC AGCTACGTCG ATGTGATCGG CGCCTAA
 
Protein sequence
MTFNRRQTLA LMGAAAASGL AAPALASGKK PVVGALSLTS HSGSFIALER GYFKEAGLDV 
ELKFFQAAQP MAVAIASGDV DFGVTAISGG LLSLADKGAV KVIGGALSEE PGIDGQKILA
SDAAYQAGLT SVAALDGKRY GMTTAGSSFH YMGSKIAGAE GGTPQFVPLQ KVGAIIGALK
SGQIDAWSIV PHIAKPLAGS GAVHIIGNVA DYLPNYQVTT VFTSAQNASK ERGLTESFLK
GFGMGVSDYN ATMVDKQNGE DAINEMVDLI HKYVYTDRPR EKAAPSIING SMRLNKDAAI
NVASVSDQLA WMQSEGLVDA GITLETFLDT SYVDVIGA