Gene TM1040_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1899 
Symbol 
ID4077396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2000157 
End bp2001935 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content62% 
IMG OID638007215 
ProductTrkA-C 
Protein accessionYP_613894 
Protein GI99081740 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCTCT TTCAATCGGG TGACACCACA AGCGCCATTA TCGCGTTGTT GATCGTTCTT 
GCGATGTTCG TGGCCTTTTT GCGCGAAACC TATCCGACCG AGGTGGTGGC GATCTGTGGC
GTCTCCTTGA TGCTGATCAC CGGGGTGCTG CCCTATGCCG AGGCGCTGCC GGTGCTGGCG
AACCCGGCGC CCTGGACCAT TGCGGCGATG TTCCTCATCA TGGGGGCCCT GGTGCGCACC
GGGGCGCTTG ATGCCTTTAC CTCTGTGGCC CGCAAAAAGG CCGAAGTGAG CCCGAAAATG
GCAATCGCGC TCCTGATGGG CTTTGTGGTG ATCGCCTCGG CCTTTGTGTC CAACACGCCG
GTGGTGGTGG TGATGATCCC GGTCTTTATC CAGATCTCGC GCACGCTCAA TGTTTCGCCC
TCCAAGATGC TGATTCCGCT CTCTTATGCA GCGGTTCTGG GCGGCACGCT CACCTTGATC
GGGACCTCTA CCAACCTGCT GGTCGATGGG GTGGCCCGCG CGCAGGGTCT TGCGCCATTC
TCTATCTTCG AGGTCACGCC CTTGGGCATC GTGGTTGTGG TCTGGGGGCT GATTTATCTG
CGCTTTATCG CGCCGCGCCT GCTGCCGGAA CGGGACAGTA TGGCGATGAT GCTCTCGGAT
CGCTCGAAGA TGAAATTCTT TACCGAAGCG GTGATCCCGC CTGACAGCAA CCTCATCGGC
CGCGAAGTCA CGGGCGTGCA GCTCTTCAAA CGCCCGGGCG TGCGGCTGAT CGACGTGATC
CGCGGCGATG ACTCGCTGCG GCGCAACCTG CAGGGCGTGG AGCTTCAGGT GGGCGACCGG
GTTGTGCTGC GCACCCAGAT GACAGAGCTT CTGAGCCTGC AGCGCAACAA GGAGCTGAAG
CGCGTCGATC AGGTCTCGGC TGTGGAAACC AAGACGGTCG AGGTGCTCAT CACGCCTGGC
TGTCGCATGG TGGGGCGCAG CCTTGGGGCA ATGCGCCTGC GTCGGCGTTA TGGCGTCTAT
GTGCTGGCGG TACACCGGCG CAACCAGAAC ATCGGCGTGC AGCTTGATGA TCTGGTGGTG
CGCGTGGGGG ATACGCTGCT GCTGGAGGGC GCGCCGGGCG ATATCCAGCG CCTTGCGGCA
GAGACGGACA TGGCCGATGT GTCCCAGCCC ACGCAGCGTG CCTATCGCCG CAGCCACGCG
CCGGTTGCAG TGGCCGCCAT GGTCGGCATC GTGATCGCCG CCGCCTTTGG TCTGGCACCG
ATCCTGATGC TGTCGATCCT TGCGGTGGCG CTGGTGCTGG CGACCCGCTG TATCGACGCG
GATGAGGCGT TTTCCTTTGT GGATGGGCGG CTTCTGGCGC TGATCTTTTC CATGCTGGCC
ATTGGCGCGG CGCTTGAAAG CTCCGGCGCG GTCGAACTCA TTGTCAACGC GATCTCCCCG
GCCTTGGTGG ATCTGCCGCC CTTCTTTCTG GTCTGGGCGG TATATCTTCT GACGTCGGTG
CTCACCGAGC TTGTGTCCAA CAATGCCGTT GCCGTGGTGG TCACGCCGAT TGCGGTGGGG
CTGGCGCAGG CCATGGGGCT TGATCCGCGC CCGCTGGTGA TAGCGGTGAT GATTGCAGCC
TCGGCCTCAT TTGCCACGCC CATCGGCTAT CAGACCAATA TGCTGGTCTA TGGTCCGGGG
GGCTACAAGT TCTCGGACTT CCTCCGGGTG GGCATTCCGC TCAATCTCTC GATGGGGCTT
TTGGCCTCGG CCGTGATCCC GCTGCTCTGG CCACTCTGA
 
Protein sequence
MFLFQSGDTT SAIIALLIVL AMFVAFLRET YPTEVVAICG VSLMLITGVL PYAEALPVLA 
NPAPWTIAAM FLIMGALVRT GALDAFTSVA RKKAEVSPKM AIALLMGFVV IASAFVSNTP
VVVVMIPVFI QISRTLNVSP SKMLIPLSYA AVLGGTLTLI GTSTNLLVDG VARAQGLAPF
SIFEVTPLGI VVVVWGLIYL RFIAPRLLPE RDSMAMMLSD RSKMKFFTEA VIPPDSNLIG
REVTGVQLFK RPGVRLIDVI RGDDSLRRNL QGVELQVGDR VVLRTQMTEL LSLQRNKELK
RVDQVSAVET KTVEVLITPG CRMVGRSLGA MRLRRRYGVY VLAVHRRNQN IGVQLDDLVV
RVGDTLLLEG APGDIQRLAA ETDMADVSQP TQRAYRRSHA PVAVAAMVGI VIAAAFGLAP
ILMLSILAVA LVLATRCIDA DEAFSFVDGR LLALIFSMLA IGAALESSGA VELIVNAISP
ALVDLPPFFL VWAVYLLTSV LTELVSNNAV AVVVTPIAVG LAQAMGLDPR PLVIAVMIAA
SASFATPIGY QTNMLVYGPG GYKFSDFLRV GIPLNLSMGL LASAVIPLLW PL