Gene TM1040_1589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1589 
Symbol 
ID4078398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1700175 
End bp1701434 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content59% 
IMG OID638006902 
ProductTRAP dicarboxylate transporter- DctM subunit 
Protein accessionYP_613584 
Protein GI99081430 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.658577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCTT TGATCTTTGC TTTCAAACTC TTGATCGCGG TGCCGGTGGC CTTGGTTCTG 
GCGCTCACGG CCATCTGGTA CATCTGGGAG AGCGGCAACA CGGTCCTCTA TGACAGTTTT
GCGCAAAAGA TGTTCTCCGG GCTAGAGAGT TACGGCCTGC TGGCGATCCC GCTGTTCATG
CTGACGGGGG AGATGATGAA CGAAGGCGGC ATGACCCGCC GCTTGATCAA TGCGGCTCGG
GTCTTTGTCG GCGGGTTTCG CGGTGGGCTC GCCTATATCA ACCTGCTGAC CAATATGTTC
ATGGCGGCGA TCATCGGATC GGCCACAGCG CAGATCGCGG TGATGGCCCG TGCGATGACG
CCAGAGATGG AAAAAGAGGG GTATGACACG GGCTTTGCCG CTGCCACGAC AGCGGCGGGC
GGGTTGCTTG CGCCGGTCAT CCCGCCGTCG ATGATGTTCG TGATCTTCGG CGTACTGGCC
CAGGTTCCGA TTGGCGAGAT GTTTCTTGCA GGCCTCATTC CGGGGATGAT GCTTGCAGGG
GCCTTTGCGC TGGTGATCTT CGGGATCGGT GTCGCCACAG GCTTTCCAAA GGGCAGCTGG
TTCACCGCGC CAGAGGCGTT CTCGGCGCTT CTATATTGCT TGCCCGCAGC GCTGATCCCC
TGTGCGATCA TCGGCGGTGT GGTGTTTGGG ATTGCAACCC CGACCGAGAG CGCTGCAATC
GCCTCTCTGC TGGCCTTTGG GATGGGGTGG CTGGTCTATG GCGAACTGAA GCCTGCGGCC
CTCTTTGCGA TGTTCCGTCG CACTGCCATC AACGCCTCGA TGATCATCTT CATGATCGCC
TGTGCCAATG TCTTTGGCTG GGTCATCATC TACGAGGCCC TGCCACAAAA GCTCGCGGCG
CTGATCACCT CGATCACCAG TGACCCGTTT CTGTTTCTGC TGATCGTCAA TCTGATCCTG
CTGCTGATCG GGATGCTGGT GGATGGGATC GCCGCCGTGA TCCTGATCTC GCCCATCTTG
TTGCCGATCG CGGTAAACCA CTATGACATC AGCGCCCATC AATTTGGTGT CGTCATGTGC
CTGAACCTCG TGCTGGGCTT GCTGACACCG CCCGTTGGGG TCGGGCTTTA TATTGCGTCT
TCCATGAGTG GCGCCACGCC CGGCCAGATC CTGCGCTCCC TCTGGCCATT CCTTTTGGCG
GTAGCGCTAA TCCTTGTCCT ATTGAGCCGG TTTCCTGTGT TGTCGACCGC GTTTCTATAG
 
Protein sequence
MTPLIFAFKL LIAVPVALVL ALTAIWYIWE SGNTVLYDSF AQKMFSGLES YGLLAIPLFM 
LTGEMMNEGG MTRRLINAAR VFVGGFRGGL AYINLLTNMF MAAIIGSATA QIAVMARAMT
PEMEKEGYDT GFAAATTAAG GLLAPVIPPS MMFVIFGVLA QVPIGEMFLA GLIPGMMLAG
AFALVIFGIG VATGFPKGSW FTAPEAFSAL LYCLPAALIP CAIIGGVVFG IATPTESAAI
ASLLAFGMGW LVYGELKPAA LFAMFRRTAI NASMIIFMIA CANVFGWVII YEALPQKLAA
LITSITSDPF LFLLIVNLIL LLIGMLVDGI AAVILISPIL LPIAVNHYDI SAHQFGVVMC
LNLVLGLLTP PVGVGLYIAS SMSGATPGQI LRSLWPFLLA VALILVLLSR FPVLSTAFL