Gene Xaut_1794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_1794 
Symbol 
ID5422160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp2022636 
End bp2023577 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content69% 
IMG OID640881042 
Productformylmethanofuran--tetrahydromethanopterin formyltransferase 
Protein accessionYP_001416696 
Protein GI154245738 
COG category[C] Energy production and conversion 
COG ID[COG2037] Formylmethanofuran:tetrahydromethanopterin formyltransferase 
TIGRFAM ID[TIGR03119] formylmethanofuran--tetrahydromethanopterin N-formyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0427197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.884171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCCC AAGCCCCCAT GATCATCAAC GGCGTCGAGA TCCGCGACAG CTTCGCCGAG 
GCGTTCCCCA TGGCGGGAAC GCGCCTCATC ATAACCGCCG ATACACCGCG CTGGGCGCAT
ACCGCCGCAG CGAGCCTCAC CGGCTTTGCC ACCTCGGTCA TCGGCTGCGG CTGCGAGGCG
GCCATCGAGC GGCAGCTTGC CCCCGACGAG ACCCCGGACG GCCGGCCGGG CTATGCCGTG
CTCATCTTCG CCATGTCCCT GAAGGATCTG AAGAAGGTGG TGCCGCTGCG GGCCGGGCAA
TGCGTGCTCA CCTCGCCCAC CTCCGCCTGT TATTCGGGGC TGGAGGGCGG GGCCGCCATC
GCGCTCGGGC GGGCGCTGCG CTATTTCGGC GACGGCTACC AGATCGCCAA GTCCATCGAC
GGGCGCCGCT TCTGGCGCAT CCCGGTGATG GAGGGCGAGT TTGTCTGCGA CGAGGTGGTG
GGCTCCACCA CCGCGGCCGT GGGCGGCGGC AACTTCCTCA TCCTCGCCCG CTCGCGCCCC
GCCGCCCTCG CTGCCGCGGA AGCGGCGGTG GAGGCCATGG GGCAGGTGCG CGGCGCCATC
ATGCCGTTTC CCGGCGGCGT GGTGCGCTCC GGCTCCAAGG TGGGCGCCAA ATATGCGGGC
ATGATCGCCT CCACCAACGA CGCCTATTGC CCGACCCTGC GCGGCGTCTC TCAGAGCGCC
CTGCCGCCGG AGGTGGAAAG CGTGCTCGAA ATCGTCATCG ACGGGTTGAG CGAGCAAGAC
GTGGCCGCCA GCATGCAGGC GGGCATAACC GCCGTCTGCG GCCTTGGCGC CGCCGCCGGG
GTGGTGGCGG TGGATGCCGG CAATTATGGC GGCAATCTCG GCCCCTTCCA TTTCAAGCTG
CGCCAGTTGA TGGCTCCTGT CGCCGGGGAG ACAGTAGTAT GA
 
Protein sequence
MTSQAPMIIN GVEIRDSFAE AFPMAGTRLI ITADTPRWAH TAAASLTGFA TSVIGCGCEA 
AIERQLAPDE TPDGRPGYAV LIFAMSLKDL KKVVPLRAGQ CVLTSPTSAC YSGLEGGAAI
ALGRALRYFG DGYQIAKSID GRRFWRIPVM EGEFVCDEVV GSTTAAVGGG NFLILARSRP
AALAAAEAAV EAMGQVRGAI MPFPGGVVRS GSKVGAKYAG MIASTNDAYC PTLRGVSQSA
LPPEVESVLE IVIDGLSEQD VAASMQAGIT AVCGLGAAAG VVAVDAGNYG GNLGPFHFKL
RQLMAPVAGE TVV