Gene TM1040_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3641 
Symbol 
ID4075069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp698127 
End bp699212 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content60% 
IMG OID638005161 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_611870 
Protein GI99078612 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.312252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.742718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAGAC GCGCGTTTCT CAAGACTGGT GCCATGGGCG CAGCCGCCAC AGCCTTGGCG 
AGCCCAGCGA TTGCCCAAGG AAAGATGCAA TGGAAGCTGG TCACAGCCTG GCCCAAGAAC
CTGCCGGGGC CGGGTGTCGC CGCACAGATG CTCGCAAACC GCATCACAAC GCTGTCGGGT
GGACGTATTG AGGTCAAACT CTTTGCTGCA GGCGAGCTTG TGCCGGGACG CGGCGTCTTT
GATGCGGTCT CCGAAGGCAC GGCAGAGCTC TATCATGCGG TTCCGGCCTA CTGGGGGTCC
AAATCCAAGG GCATTTTGCT TTTTGGCTCG CAGCCCTTTG GCCTGCGCGC AGACGAGCAG
TTTGGCTGGC TCTACCATGG TGGCGGTCAG GCGCTCTATG ACGAGATGTA TGGCCGCTTT
GGCATCAAGC CTTTCCTCTG CGGTAACTCC GGCCCGCAAT GGGGCGGCTG GTTCAAAACC
GAGATCAATT CCGCCGAAGA CCTGAAGGGG TTGAAGTTCC GCACCACGGG CCTTGCATCC
GAAATGGCGT CGAAACTTGG CATGGCAGCC GAAGCCACAA GCGGACCGGC CATGTTCCAA
GGCCTGCAAA CGGGTGCATT GGACGCAGGC GAGTTCATCG GCCCCTGGAC CGACAGCGCG
CTTGGCTATT ACCAGGTCGC CAAGAACTAC TACTGGCCCG GCGTGGGTGA ACCCTCTTCT
GCCGAGGAAT GCGGGGTAAA CGCTGACGTC TTTGCCGAAC TGCCGGATGA TCTCAAACAG
GTTGTCTCGC TGGCCTGCGA AAGCCTCTAC AATCCGGTCT GGACGGAATA CACCACCAAG
CACGCACTGG CGCTGAAGAA AATGGTCGAA GAGGACGGCG TTCAGGTCAA GATGTTCCCG
TCCGACGTGA TCGAGGCCAT GGGCACCGCA GCAGCAGAAG TCATCTCTGA GCTGCGCGAA
GACGAAGACG AACTTGTGCG CCGTATCACC GAGAGCTTTA TCAGCTATCG CGACAGTGTG
GGCGGCTACA TGACCTATGC CGACAACGGC CAGATGAACG CCCGCGCCTC GGTTATGGGC
TACTGA
 
Protein sequence
MQRRAFLKTG AMGAAATALA SPAIAQGKMQ WKLVTAWPKN LPGPGVAAQM LANRITTLSG 
GRIEVKLFAA GELVPGRGVF DAVSEGTAEL YHAVPAYWGS KSKGILLFGS QPFGLRADEQ
FGWLYHGGGQ ALYDEMYGRF GIKPFLCGNS GPQWGGWFKT EINSAEDLKG LKFRTTGLAS
EMASKLGMAA EATSGPAMFQ GLQTGALDAG EFIGPWTDSA LGYYQVAKNY YWPGVGEPSS
AEECGVNADV FAELPDDLKQ VVSLACESLY NPVWTEYTTK HALALKKMVE EDGVQVKMFP
SDVIEAMGTA AAEVISELRE DEDELVRRIT ESFISYRDSV GGYMTYADNG QMNARASVMG
Y