Gene SYO3AOP1_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSYO3AOP1_1066 
Symbol 
ID6331056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfurihydrogenibium sp. YO3AOP1 
KingdomBacteria 
Replicon accessionNC_010730 
Strand
Start bp1107360 
End bp1108898 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content32% 
IMG OID642657354 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001931239 
Protein GI188996988 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAGAG CTTTGATATC TGTTTCTGAT AAAACAGGTG TATTAGAATT TGCAAAAGAG 
CTTAAAAATC TTGGATATGA GATTATATCC TCTTCAGGAA CTGCAAAATA TTTAAAAGAA
AATGGCATAG ATGTTATAGA AGTTTCACAA ATAACAGGAT TTCCAGAAAT ACTTGATGGT
AGAGTAAAAA CACTACACCC AAAAATTCAT GGCGGAATTT TAGCCATCAG AGACAACCAA
GACCATATGA AACAGCTTCA AGAAAATGAT ATTAAACCAA TAGATATTGT AGCTATAAAT
TTATATCCAT TTGAAAACAC AGTTAAAAAA GGTGCTGATT TAGATGAGAT TATAGAGAAT
ATAGACATTG GCGGTCCTGC AATGGTAAGA GCATCTGCTA AAAATTATAA ATACGTTGCC
ATAATAACAG ACCCAAAAGA TTATAGCGAT ATAATCAACG AGCTTAAAGA ATACGGAGAA
ATAAGCATAA ATACAAAGAA AAAACTTTCA TTAAAAGCAT TTAGACATAC AGCCTTTTAT
GACAGTATAA TTTCACAAGT TTTAAATGAA AAATTTGAAA TAAACGAAGA TTTTCCAGAA
AGTTTAACCA TTCCAATGAG ACTAAAATCC GGGCTAAGAT ATGGAGAAAA TCCACATCAA
AAAGCATCTC TTTACATAAA CCCACTTGAA AATGGCATTT CTGTTGCAGA TAGTGAGATA
TTACAAGGTA AAGAAATGTC TTTTAACAAC TACTACGACG TTGATTCTGC TGTGCTTTTG
GTTAAAGAGT TTGAAGAACC AGCCTGCGTT ATAGTGAAGC ATAACAACCC TTGCGGTGTT
GCAGTTGCAG AAAATATAAA ACAGGCTTAC ACCTTTGCCC TTGAAACAGA CCCAAAATCA
GCATTCGGCG GAATTGTAGC ATTTAATAAA GAAGTTGATG AAGATACAGC AAAAGAACTT
ACAAAATTAT TTTTAGAGGT TGTAGTTGCA CCTTCATTTT CAGATTCAGC GTTGGAAGTA
TTAAAAACTA AGAAGAATTT AAGAGTTGTA AAAGTTAAAA ACTTTGATAA AAAATTAGAA
GGAAAAGATA TAAAAAGAAT CTCGGGCGGT TATCTACTCC AAGACAGAAA TTTAGGTCTC
TATACAGAGT TAAAAGTAGT TACAGATAGA CAGCCAACAG AAAAAGAGTT GGAAGATTTA
ATATTTGCTT TAAAAGTTGT TAAGCATGTA AAATCTAATG CGGTTGTGAT AGCAAAAGAC
AAAAGAACTG TTGGCATTGG AGTAGGACAA ACTTCAAGAG TAGATAGTTT AGAAACTGCA
ATCAAAAAAG CAAAAGAATT TAATCTACCA TTAGAAGGAA GCGTTCTTGC ATCAGAAGCA
TTTTTCCCAT TTAGAGATAG CATTGATACA GCAGCAAAAG AAGGAATAAA AGCTGTTATA
CAACCAGGTG GTTCAATCAG AGACCAAGAG GTAATAGATG CATGTAATGA GCATGGAATT
GCTATGATAT TTACAAACAT GAGACACTTT AAACATTAA
 
Protein sequence
MKRALISVSD KTGVLEFAKE LKNLGYEIIS SSGTAKYLKE NGIDVIEVSQ ITGFPEILDG 
RVKTLHPKIH GGILAIRDNQ DHMKQLQEND IKPIDIVAIN LYPFENTVKK GADLDEIIEN
IDIGGPAMVR ASAKNYKYVA IITDPKDYSD IINELKEYGE ISINTKKKLS LKAFRHTAFY
DSIISQVLNE KFEINEDFPE SLTIPMRLKS GLRYGENPHQ KASLYINPLE NGISVADSEI
LQGKEMSFNN YYDVDSAVLL VKEFEEPACV IVKHNNPCGV AVAENIKQAY TFALETDPKS
AFGGIVAFNK EVDEDTAKEL TKLFLEVVVA PSFSDSALEV LKTKKNLRVV KVKNFDKKLE
GKDIKRISGG YLLQDRNLGL YTELKVVTDR QPTEKELEDL IFALKVVKHV KSNAVVIAKD
KRTVGIGVGQ TSRVDSLETA IKKAKEFNLP LEGSVLASEA FFPFRDSIDT AAKEGIKAVI
QPGGSIRDQE VIDACNEHGI AMIFTNMRHF KH