Gene TM1040_3213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3213 
Symbol 
ID4075317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp208049 
End bp209620 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content60% 
IMG OID638004722 
Producthypothetical protein 
Protein accessionYP_611449 
Protein GI99078191 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.628959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACAG TCGGACACGC CTTGGCACAA ACCCTTGTTG AGCAGGGCAC CGAGATCATC 
TTTGGCATCC CCGGCGTGCA CACCATCGAG CTCTATCGTG GCATCGAAGG CGCAGGTATC
CGTCATATCA CACCGCGTCA TGAACAGGGT GCGGGATTTA TGGCAGACGG ATATGCCCGT
ATCGCTGGCA AGCCCGGCGT TGTCTTCGTG ATCACCGGAC CGGGTTTGAT CAACACGTTG
ACCCCGATGG CGCAGGCCCG CGCGGACAGC ATTCCAATGA TCGTGGTCAC CGGCGTCAAC
CGGCGCGACA GTCTTGGCAA GGGTCTCGGC CTCTTGCATG AGTTGCCAGA TCAACTGGGT
CTGTCGCAAA CGATATCCAA ACATGCCGAG CAAGTCGAAG ACGCATCGGC GCTCGAAGGG
GTGATGGCCC GCGTGTTTGG CGCTCTTCAA GGGCGCCCTG CCCCGGTCCA CGTCGAAGTG
CCGACGGACG TCATGACCCT ACCGGCGTCC GAAACGGTGA CGATCCCGGA TCCAGAGCCA
AAGGCTGCAA GCGATCTGAC TCCGATCCTG GATGCCCTGG CCCGATGCGA GAGTCCTGTG
ATCCTCGCGG GGGGCGGCTG CCGCACGCAG AACCTTGCGC TCTTGAAACT GGCACAAAGG
CTCGATGCGC CCGTCGTCCA GACCGTGAAT GCGCGTGGCC TGATGCATGC ACATCCCCTG
ACGGTTCCAG CCAGTCCGTC CCTTCAATCG GTGCGAGATC TCATTGCGGA AGCGGATTGT
GTTCTGGCTC TTGGAACCGA AATGGGGCCA ACCGACTACG ACATGTATGC GACCGGGACC
TATCCGGAGA TGAGCAACCT TCTGCGGATC GACATTTGTG ACGATCAACT CTCGCGCCAT
GAGGCTGCCT GCAGGCTCAC CGGGGATCTG AACGAAATTC TACCCGTTCT GGCCGAACAA
TCCCCGGGAA AATCAAATGC GCGTGGATCT GAGCGTGCAG AAAAGGCGCG CCTTGCGGCG
CGCGCCGAAA TCGAAGCGCT GACACCCGGG TATGCGCGTT TCGTATCGCA GATCGAGACG
CTGAGGGATG CTTGTCCAGA CGCTATTTTT GTCGGGGATT CCACGCAAGC GGTCTATGCG
GCCAACCTCT ATTATGACCA CAACCGCCCC GGAGGCTGGT TCAACGGCGC GACTGGTTTT
GGGGCCCTCG GCTACGCAAT CCCCGCCGCG ATTGGCGCCG CTCTTGCCGA TCCATCGGCA
CCGGTGGTCG CGCTGATGGG CGACGGCGGT GCGCAGTTCA CGCTGCCAGA GCTTGGCGTT
GCCCGCGATG AAAACCTGCC CATCCTGTTT GTCGTCTGGA ACAACAATGC GTTCCTGGAA
ATCTCGAACG CAATGGAGGC CGCAGGAATC TCTCCGACAG GCTGCCACCC TTCTGCACCT
GATTTTGAGG CAGCTGCCGC CGCATACCGG CTCGATTTTC GGCGGATCGC ACCAGAGCAA
CTTCAGAACG CGTTGTCCGA GATCCTCCCA CTCAATGGTC CGATGCTGCT CGAGATCGAT
ATGACGGGCT AG
 
Protein sequence
MRTVGHALAQ TLVEQGTEII FGIPGVHTIE LYRGIEGAGI RHITPRHEQG AGFMADGYAR 
IAGKPGVVFV ITGPGLINTL TPMAQARADS IPMIVVTGVN RRDSLGKGLG LLHELPDQLG
LSQTISKHAE QVEDASALEG VMARVFGALQ GRPAPVHVEV PTDVMTLPAS ETVTIPDPEP
KAASDLTPIL DALARCESPV ILAGGGCRTQ NLALLKLAQR LDAPVVQTVN ARGLMHAHPL
TVPASPSLQS VRDLIAEADC VLALGTEMGP TDYDMYATGT YPEMSNLLRI DICDDQLSRH
EAACRLTGDL NEILPVLAEQ SPGKSNARGS ERAEKARLAA RAEIEALTPG YARFVSQIET
LRDACPDAIF VGDSTQAVYA ANLYYDHNRP GGWFNGATGF GALGYAIPAA IGAALADPSA
PVVALMGDGG AQFTLPELGV ARDENLPILF VVWNNNAFLE ISNAMEAAGI SPTGCHPSAP
DFEAAAAAYR LDFRRIAPEQ LQNALSEILP LNGPMLLEID MTG