Gene TM1040_3319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3319 
Symbol 
ID4075724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp327591 
End bp328571 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content59% 
IMG OID638004827 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_611553 
Protein GI99078295 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0205242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.450742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTT TTTCCCGCAG TCTCCTGGCC GCAGGTGCCA GCTTTGCCTT TGCCGTTCCG 
CTGGCCGCAC AGACCGAGAT CAAGATTGGC TATGCACTGG CAGAGGACAG CCATTACGGC
GTGGCTGCCA AGACATTCGA AGAGGTTGTG CTGGAGCAGA CCGGTGAGGA TTTCAGCTTC
ACGCATTTCC CGTCGTCCGG TCTGGGCGGT GAGCGCGATG TGATCGAGGG CCTGCAGCTT
GGCACCGTGG AAGTCACCAT CGTGTCTTCC GGCACGCTGG CCAACTTTGT CCCTGAAACT
GGTGTTTTCG ACATCCCGTT CCTGTTCCGG GATCTTGGCC ACGCCCGCTC GGTGCTCGAC
GGCCCCATCG GTCAGGACAT CCTTGAAAAG TTTGACGCTG TTGGCCTGCA TGCGCTGGCA
TGGGGCGAGC AGGGCTTTCG CCATATCACC AACAACCGTA ATGCAATCAA CACTCCTGCC
GACGTTCAGG GGCTGAAGCT GCGCACAATG GAAAACCCGG TCCACCTCGC GGCGTTCAAC
GCGATGGGCG CCGCGCCGAC ACCGATGGCG TGGCCCGAGG TGATCTCTTC CATGCAGCAA
GGGGTGATCG ACGGACAGGA AAACCCGCTC TCGGTGATCG TTTCGGTGAA ACTGGACGAA
GTGCAGAAAT ACCTGACCCT CTCCGGTCAC GTTTATTCGC CTGCGATGCT CTTGGTGTCC
AAACCCTTCT GGGAAGGTCT GAATGACGAG CAAAAGGCTG CGTTTGAAGC CGCCGCCGCC
GAGGCCGTGG GTGCCATGCG CGGATACGTC GATGGCATCG AAGCCAGCGG TGTTGAAACG
CTCAAGGAAC GCGGCATGGA AGTGAACGCG CTGAGCGCCG ATGAAAAAGC CGCGTTCCAA
GCGTCAATCC AGTCTGCCTA CGAGGGCTAT TACAAGACCT ATGGCGAGGA TCTCGTGAAA
TCGATCGTCG CGGCTGAGTG A
 
Protein sequence
MTVFSRSLLA AGASFAFAVP LAAQTEIKIG YALAEDSHYG VAAKTFEEVV LEQTGEDFSF 
THFPSSGLGG ERDVIEGLQL GTVEVTIVSS GTLANFVPET GVFDIPFLFR DLGHARSVLD
GPIGQDILEK FDAVGLHALA WGEQGFRHIT NNRNAINTPA DVQGLKLRTM ENPVHLAAFN
AMGAAPTPMA WPEVISSMQQ GVIDGQENPL SVIVSVKLDE VQKYLTLSGH VYSPAMLLVS
KPFWEGLNDE QKAAFEAAAA EAVGAMRGYV DGIEASGVET LKERGMEVNA LSADEKAAFQ
ASIQSAYEGY YKTYGEDLVK SIVAAE