Gene TM1040_3357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3357 
Symbol 
ID4075256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp368526 
End bp369584 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content59% 
IMG OID638004865 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_611591 
Protein GI99078333 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.422667 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.485488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAAA CTTTGGTTAA GACAGCTGCG CTGTCGGTGC TTCTGGCAGG CACCGCCCTG 
ACAGCAAGTG CTGCCGATTA CACACTGCGT GCAACGGCAA ACTCGAACGA AAACGACGAA
GACTACGATG GCCTCGTGGT TTTCAAAAAC TACGTCGAAG CCGCATCCAA TGGCGCCATC
GAAGTGGAGC TGTTCATCGG TACGCAGCTG TGCTCGAACG GGGCGGAATG CCTTCAGGGC
GTCGCGGATG GTTCGATTGA CATCTATATC TCGACCTCGG GCGGTGCCTC CGGCCTGTTC
CCCTATGTGC AGGTTCTGGA CCTTCCGTAT CTGATGGCGG ACGACCGGAT TGCAGAGCAT
GTCCTGTCCG GTGATTTCAC CCGCACCATG CGGGACATGG CTCTGGAAGA TTCCGGCGAC
ACCATTCGTC TGATGACCAT CGGCAACACC GGCGGTTGGC GCAACTTTGC CAACACCAAA
CGCCGCATCG CAGAGCCTGC GGACATGGAA GGTTTGAAGA TTCGCACCGT GGTTGCGGAC
CTGCCGCAAG AACTGGTCAA AGCCCTGGGT GCATCCCCGA CCCCGATCCC GTGGCCGGAA
CTGTTCACCT CCTTTCAGAC CGGAGTTGTT GAAGGGTCGA AGAACGGTAT CACCGACATC
ATGGGCATGA AGTTCCCCGA TGCTGGTTTG CAGTATGTCA CCCTGGATGG CCACGCCTAC
ATGGGGGCCT TGTGGTGGAT GTCGAACCAA AGCTTCCAGG CGATGCCGGA AGACATGCGC
CGCGTGGTTG TGGACGGCTT CTACGCGCTG CAGCAGGCGA CCTTCGCGTC TCCGAAGCGT
AAATCCATCG CGGCTTACGA AGAATTCGTA GCAGGTGGTG GCGACCTCTA CGTACCGACC
CCGGACCAGA AAGCCGCCTT CAAAGAAGCC GCTTCCCCGG TCTACGACTG GTTCAAGTCC
AACGTGACCC GTGGTGACGA AATCTTCACC GCGCTGACCG ACGCCGTGGC AGCTGCCGAG
GCCGAGATCG ACGCGGATCG CGCTAAAGAC CTGAAATAA
 
Protein sequence
MLKTLVKTAA LSVLLAGTAL TASAADYTLR ATANSNENDE DYDGLVVFKN YVEAASNGAI 
EVELFIGTQL CSNGAECLQG VADGSIDIYI STSGGASGLF PYVQVLDLPY LMADDRIAEH
VLSGDFTRTM RDMALEDSGD TIRLMTIGNT GGWRNFANTK RRIAEPADME GLKIRTVVAD
LPQELVKALG ASPTPIPWPE LFTSFQTGVV EGSKNGITDI MGMKFPDAGL QYVTLDGHAY
MGALWWMSNQ SFQAMPEDMR RVVVDGFYAL QQATFASPKR KSIAAYEEFV AGGGDLYVPT
PDQKAAFKEA ASPVYDWFKS NVTRGDEIFT ALTDAVAAAE AEIDADRAKD LK