Gene TM1040_3230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3230 
Symbol 
ID4075372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp228329 
End bp230506 
Gene Length2178 bp 
Protein Length725 aa 
Translation table11 
GC content59% 
IMG OID638004739 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_611466 
Protein GI99078208 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.857008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCA ATATTTCCCG TCGTGGGTTT CTGAAAACCG CAGCCGCCGC GGGCGCGGTG 
CTTTATGTTG GGGCCACAGC TGACGGCGCT GCCGCTGCAG GGTCGTCGGA TGCGATGCTG
AATCCTTTTG TGAAGATCGA CAGCGATGGC AACATCATTG CGATCATCAA GCATTTTGAA
AAAGGACAAG GACCCGCCAC CGGCCTCTCC ACGCTGATTG CCGAAGAACT TGGCGTGACC
ATGGAGCAGA TTGGCTACGA ATTCGCCCCC TCCGACCCGC AGGTTTACAA CAACCTTTTG
TTTGGTCCTT TTCAGGGTAC TGGCGGCTCA ACCGCCATGG CCAATTCCTG GATGCAGTAC
CGCACAGCTG GTGCGATTGC ACGTGAGATG CTGATCAAGG CGGCAGCGCA GGCTTGGGGC
TTGGACACCG CAGATTTGGA CATCCAAGAC GGCATGGTGA CTGGCGGTGG AAAGTCCGCG
CCACTTGGAG ACTTTGTCGC GGCAGCGGCA GAGATCGCTC CATCTGAAAC GCCACGTTTG
CGGGACCAAT CCGAGTGGAA GATCATCGGA AAAGAGAGCA CACGACGCCT GGATAGCGCG
ATCAAAGTCA ATGGTCAGGC GCAGTTTGGC ATGGATCTGC ACCTGCCGAA CCAAATGGTC
GTCATGATCA AACGCACGCC ACAGCGTGGC GGCGTGGTGG CCGGATTTGA TGACAGCGCA
GCCAAGGACG TTAAAGGGTT CATCATGGCA ATGCCGCTCC CGACCAAACA TGGTGTTGCG
GTTTACGCCG AAAACACCTG GACCGCGATG CAGGCGCGGG CAGCGGTCGA AGTCGACTGG
GATATGTCTG CTGCCGAAAC GCGTTCCTCG CAGGAAATCC GCGACGAAAT CATGGCCGCG
TTGGATGCCG CGCCAGTGTA CAATGTGAAC AAGGCCGATA CAAACGCCGT CGCTTTGGCG
GTCGATGAGG CAGCGCAAGT CTTGGAAAAG ACATTCTATT TCCCGCTTCT TGCCCATGCG
CCGATGGAGC CGATGAACTG CACCATTGAA CAAACAGCGG ATGGCGACAT CGTCCTGCAT
GATGGGGCGC AGATGCCAAC CGGCCCGCAT ATGGCCTATC AGCAGATCTT TGGTCTCCCC
GCCGAGAAAA TCCACATCAA CACGATGCTG GCGGGCGGCT CCTTCGGGCG ACGGGCAACG
CCGGATGCTG ATTACCAGGT GGAAGCGGGT CTGGCGTTTG CGATGACCGA TCGCTCTCGC
CCGGTGAAGC TGGTCTGGGA TCGGGAAGAC GACATTCGTG GTGGCTACTA CCGTCCGGCG
ACGGGGCATA AGGTGCGGAT CGGGCTCGAC GCAGACGGCA AGATTACTGG ATGGGAGCAT
CAGGTCGCAG GTCAGTCGAT CATGAAAGGC ACAGCCTTTG AAGCGATGGC CGTCAAGGAT
GGCATTGACC ACTCCACCGT CGAAGGTCTG GCCGACAACC CTTATGTGAT CCCAAACATG
GCCGTTGGTC TGACAGACAC CGAAAAGGCC ACCTCCGTGC TGTGGTGGCG CTCGGTTGGG
CACACGCACA CCGCGTATGT GATGGAGGTC ATGATGGATA TGGCCGCCAA GGCCGCTGGG
CGTGATCCGG TCGAGTTCCG CTTGGCCTAC CTTGAGGGCG GCAACAAAGA TGCGCAGCGC
AAAGCTGGCG TTCTGAAGCT CGCGGCCGAG AAAGGGAACT GGGGCAACCC TGCGGCAGGC
AATGTCCAAG GTATCGCGGT GCACAAGTCG TTTGGGTCGT TTGTCGCTGA GGTCGTCGAA
GTGTCGGGCA CACCCGACGA TGGCATCCAG ATCGAAAAGG TCACGGCTGC CGTCGATTGC
GGCATTGCAG TAAACCCGGA TGTCATCCGG GCCCAGACCG AAGGCGCCAT CGGGTATGGC
ATCGGTCATG CGATGCGGGA CCAGATCACA CTCGATGGTG GCGAGGTGGA GCAATACAAC
TTCCCGGACT ACGAGCCACT TCGGATCTCT GACATCAAGG CGATCGAGAC GCATATTGTG
GCGTCTGCTG AAGCACCGAC CGGCATTGGG GAACCGGGTA CTCCGCCCTC GGCCCCAGCG
TTGGCGAATG CGATCGCACA GTTGGATGTT CGCGTTGCAG AGCTCCCGAT GAGCGAAAAC
GGCGTGTCCT TCGCCTAA
 
Protein sequence
MTANISRRGF LKTAAAAGAV LYVGATADGA AAAGSSDAML NPFVKIDSDG NIIAIIKHFE 
KGQGPATGLS TLIAEELGVT MEQIGYEFAP SDPQVYNNLL FGPFQGTGGS TAMANSWMQY
RTAGAIAREM LIKAAAQAWG LDTADLDIQD GMVTGGGKSA PLGDFVAAAA EIAPSETPRL
RDQSEWKIIG KESTRRLDSA IKVNGQAQFG MDLHLPNQMV VMIKRTPQRG GVVAGFDDSA
AKDVKGFIMA MPLPTKHGVA VYAENTWTAM QARAAVEVDW DMSAAETRSS QEIRDEIMAA
LDAAPVYNVN KADTNAVALA VDEAAQVLEK TFYFPLLAHA PMEPMNCTIE QTADGDIVLH
DGAQMPTGPH MAYQQIFGLP AEKIHINTML AGGSFGRRAT PDADYQVEAG LAFAMTDRSR
PVKLVWDRED DIRGGYYRPA TGHKVRIGLD ADGKITGWEH QVAGQSIMKG TAFEAMAVKD
GIDHSTVEGL ADNPYVIPNM AVGLTDTEKA TSVLWWRSVG HTHTAYVMEV MMDMAAKAAG
RDPVEFRLAY LEGGNKDAQR KAGVLKLAAE KGNWGNPAAG NVQGIAVHKS FGSFVAEVVE
VSGTPDDGIQ IEKVTAAVDC GIAVNPDVIR AQTEGAIGYG IGHAMRDQIT LDGGEVEQYN
FPDYEPLRIS DIKAIETHIV ASAEAPTGIG EPGTPPSAPA LANAIAQLDV RVAELPMSEN
GVSFA