Gene Afer_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_1683 
Symbol 
ID8323774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp1769193 
End bp1770446 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content68% 
IMG OID644952814 
ProductGTP cyclohydrolase II 
Protein accessionYP_003110272 
Protein GI256372448 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.10286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGA CAACAGCGGC GACGCCTTTC TCGTCGGTCG AGGATGCCAT CGAGGCGTTT 
CGCCGAGGCG AGATGGTCAT CGTCGTCGAC GACGAGGATC GTGAGAACGA GGGTGACCTC
ATCATGCCTG CCGACGCGGT GCGTCCGGAC GACGTCGCCT TCTACCTGCG CGTCACCTCG
GGCATGATCT GCGTGTCGGC CGAGTCGCAG CTCCTCGACG CCCTCGAGCT GCCGCTCATG
GTGGAGCACG GGACCGATCC CCGCAAGACC GCCTTCACCA TCACCGTGGA CGCTGCGCGC
GGCGGTACGA CCGGCATCTC GGCCGAGGAC CGAGCCCGCA CTGTGGCGCT GCTCGCGAGC
CCGGACGCGC GCCCTAGCGA CTTCGTGCGA CCTGGTCACG TGTTCCCGCT GCGCGCGCGT
GAGGGTGGGG TCCTCAAGCG AGGTGGACAC ACCGAGGCAG GGGTCGATCT TGCCCGCCTC
GCCGGTCGGC GGCCTGCCGC GATGCTTGCC GAGATCACGA CCGAGGATCG TCAGTCCATG
GCCCGGCTCC CCGAACTGGC GGCGTTCGCC GCTGCCTGGG GCGTCCCGCT CATCTCCATC
GCCGACCTGA TTCGCTACCG GGCGGCGCGC GAGACGCTCG TCGCACGCGT CGAGGGGTCG
GAGGCGACCA TTCCGACCCA ATGGGGAGCG TTCGATGCGG TGGTGTACCG GAGCGTCCTC
GAGCCCGGGG ACGAGCATCT CGTGCTCAGC CTGGGCGAGA TCGACGACAC GGAACCGGTG
CTCGTGCGGG TGCATTCGGA GTGCTTGACC GGGGATATCT TCGGGTCGTA TCGGTGTGAT
TGCGGTCCGC AGCTCCACGC CGCCCTCGCA CGGATCGCGG ACGAAGGACG AGGGGTGGTG
GTCTACCTGC GTGGGCACGA GGGCCGCGGC ATCGGGCTCG GTCACAAGCT GCGCGCGTAC
CACTTGCAGG AGTTCGAGGG GCTCGATACG GTCGAGGCGA ACGAGCGCCT CGGGCTGCCG
GTCGATGCGC GTGAGTACGG CATCGGTGCC CAGATCCTCG CCGACCTCGG TGTGCGACGC
ATGCGTCTGC TCACGAACAA TCCGGCCAAG TACCGTGGAC TCACTGGATT CGGCCTCGAG
ATCGTCGAGC GCGTGCCGAT CGTCACCGAG GTGCGACCCG AGAACGCGCG CTATCTCGAG
ACCAAGGCGC GCAAGTTGGG GCACCTGCTC GACGTCGCGA GGGGAGAACG ATGA
 
Protein sequence
MSETTAATPF SSVEDAIEAF RRGEMVIVVD DEDRENEGDL IMPADAVRPD DVAFYLRVTS 
GMICVSAESQ LLDALELPLM VEHGTDPRKT AFTITVDAAR GGTTGISAED RARTVALLAS
PDARPSDFVR PGHVFPLRAR EGGVLKRGGH TEAGVDLARL AGRRPAAMLA EITTEDRQSM
ARLPELAAFA AAWGVPLISI ADLIRYRAAR ETLVARVEGS EATIPTQWGA FDAVVYRSVL
EPGDEHLVLS LGEIDDTEPV LVRVHSECLT GDIFGSYRCD CGPQLHAALA RIADEGRGVV
VYLRGHEGRG IGLGHKLRAY HLQEFEGLDT VEANERLGLP VDAREYGIGA QILADLGVRR
MRLLTNNPAK YRGLTGFGLE IVERVPIVTE VRPENARYLE TKARKLGHLL DVARGER