Gene TM1040_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1895 
Symbol 
ID4077392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1995323 
End bp1997098 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content64% 
IMG OID638007211 
ProductTrkA-C 
Protein accessionYP_613890 
Protein GI99081736 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.716075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.940085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTGG ATCAAACCCT GCTTTTTGGC CTCTTTGCGA CCGTGTTCGG CCTGCTGCTC 
TGGGGCAGGT TTCGCTATGA TCTGGTGGCC TTTGGCGCAC TGATGGTGGC GGTGGTCCTG
GGGCTGGTGA AGGCCGAAGA TGCCTTTGCG GGCTTTGGTC ATCCGGCCAC CCTGGTGGTG
GCGCTGGTGC TTGTGGTGTC GGCCGGGCTG GTGAAATCCG GGGCCGTGCA TCTTATCACC
CGGACACTGG TGGACCAGAG CCGGGCGCTT GGCGCGCATA TTGCGCTGAT GGGCAGCGTG
GGCGCGGTCT TGTCGGCCTT TATGAACAAT GTGGCCGCTC TTGCGCTCTT GATGCCAGTG
GACATCCAGA CGGCACGCAA GGCAGGGCGC ACGCCAGGTC AGAGCCTCAT GCCCTTGTCC
TTTGCCACCA TTCTTGGCGG CATGGCGACG CTGATCGGCA CCCCGCCCAA TATCATCATC
GCCAGCATCC GCCATGACAC GTTGGGCGAA CCGTTCAAGA TGTTCGATTT TGCCCCCGTG
GGGTTGGCGG CGGCAGCGGC GGGGCTGGCC TTTGTGGCGC TGGTGGGCTG GCGGCTGATC
CCGGATCGTT CGGGCGCGGC GGGCGCCACC GAGAGCCAGC TTGCCCCCTA TATCGCAGAG
CTGACAGTGC CAGAAGACAG CGATCTGATT GGCCAGCGGC TTGGCAGTCT GGATCCGGAA
GCCGAGAAGG CCGATGTGGC CATCCTGGGG CTGATCCGCG GCGGGCAACG TCGCTATGGC
CGGGCAGCGG GGGCGCTACT GCAGGCGGGC GATACCATCG TGCTGGAGGC CACCCCGGAT
GCGCTGGATG AATTCCGCGC CACGCTCAAA CTCGATTTTT CCGATGCCGC GCGCGAAGAA
AAGCTGAAGG CTGCGGGCGA AGGGCTCGAG CTGATCGAGG TGGTGGTGCC AGAATATTCC
CGTATTGCGG GCCGCAGCGC GCAGGGCGTC GGACTCGCCT GGCGCCAGAG CGCGGTACTC
CTTGGCATTG CGCGTCAGGG CGAGCGGCTG ACCAAACATT TGCGCCAGAC CGAAGTGGCG
CCCGGAGACA TCCTGCTGAT CCTCTGCCCG CGCGATCGCG GGGCAGAGGT TGCAGAGTGG
TTGGGCTGTC TGCCACTCGC GGCACGCGGT CTTTCGGTGA CGGCCAATGA CAAGACCTGG
TGGGCAATCG GGCTCTTTGC CGCGGCGGTT CTGGCGGCGT CGGTGGGGCT GGTCTATCTG
CCGGTGGCGC TGGGACTGGT TGCGATTGGC TATGTCTTGC TAAAGATCCT GCCTGTGGCA
GAGATCTATG ACCATGTGGA ATGGCCTGTG GTGGTGCTCT TGGGGTCGAT GATCCCGCTG
GGACAAGCAC TCGAAACGTC AGGCGGCACC GAGCTTCTGG CCCATGGGCT TGTTGAGCTG
ACCACAGGGC TGCCCGCCTG GGCGATCCTG ACAGTGCTGA TGGTGGTGAC AATGACGCTG
TCGGATGTGC TCAACAACAC GGCGACCGCG ATTGTGGCGG CGCCGGTGGG GATCTCCATG
GCTCAGGCGC TGAATGTGTC GCCCGACCCC TTCCTGATGG CGGTTGCGGT GGCGGCCTCG
GCGGCTTTCC TCACCCCCAT CGGCCATAAG AACAACACGT TGGTGCTTGG CCCCGGCGGC
TATAAGTTCG GCGATTACTG GCGGATGGGG CTGCCGCTCG AGATCCTTGT GATCGCGGTC
TCGATCCCAG CGATCCTGTT CTTCTGGCCC CTCTGA
 
Protein sequence
MTLDQTLLFG LFATVFGLLL WGRFRYDLVA FGALMVAVVL GLVKAEDAFA GFGHPATLVV 
ALVLVVSAGL VKSGAVHLIT RTLVDQSRAL GAHIALMGSV GAVLSAFMNN VAALALLMPV
DIQTARKAGR TPGQSLMPLS FATILGGMAT LIGTPPNIII ASIRHDTLGE PFKMFDFAPV
GLAAAAAGLA FVALVGWRLI PDRSGAAGAT ESQLAPYIAE LTVPEDSDLI GQRLGSLDPE
AEKADVAILG LIRGGQRRYG RAAGALLQAG DTIVLEATPD ALDEFRATLK LDFSDAAREE
KLKAAGEGLE LIEVVVPEYS RIAGRSAQGV GLAWRQSAVL LGIARQGERL TKHLRQTEVA
PGDILLILCP RDRGAEVAEW LGCLPLAARG LSVTANDKTW WAIGLFAAAV LAASVGLVYL
PVALGLVAIG YVLLKILPVA EIYDHVEWPV VVLLGSMIPL GQALETSGGT ELLAHGLVEL
TTGLPAWAIL TVLMVVTMTL SDVLNNTATA IVAAPVGISM AQALNVSPDP FLMAVAVAAS
AAFLTPIGHK NNTLVLGPGG YKFGDYWRMG LPLEILVIAV SIPAILFFWP L