Gene Hhal_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2000 
Symbol 
ID4710417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2204773 
End bp2206356 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content69% 
IMG OID639856473 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001003566 
Protein GI121998779 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00113973 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACCA ACGACGGCGT GCGGCCCCTG CGGCGGGCCC TGATCAGCGT TTCCGATAAG 
AGCGGGGTGG AGGGCTTTGC CCGCGCCCTG CATGAGCAAG GCGTCGAGAT CCTCTCGACC
GGTGGTACGG CCCGCCTGCT GGGTGAGGCC GGGATCCCGG TGCGGGAGGT CTCGGCCGAG
ACCGGCTTCC CGGAGATCAT GGACGGTCGC GTCAAGACCC TGCATCCGCG CATCCACGGC
GGGCTGCTGG GCCGGCGCGG CACGGATGAC GCGGTCATGG ACGAGCACGG CATTGGTCCC
ATCGATCTGC TTTGCGTCAA CCTCTACCCC TTCGAGCAGG CCGTGGCCGC CGAGGGCTGC
ACCTTGACCG ACGCCATCGA GAATATCGAT GTCGGCGGGC CGGCGATGAT CCGTGCGGCC
GCCAAGAACC ACGCCGACGT GGCGGTGGTG ACCGAGTCGT CGGCCTATGG CCTGGTCCTC
GATGAGCTCC AGCGCCTTGG CGGGACCAGC CGCGCCCTGC GCCACCATCT GGCGACCCGA
GCGTTCAGCC ACACCGCGCG CTACGACGGC GCCATCGCTG CCTACCTGAG CCAGCGCGAC
GAACAGGGCG AGCAGCAGGG CGATTTCCCG GCGATCTGGA CGCTCCAGGT GGAGAAGGTC
GCCGACATGC GTTACGGCGA GAACCCCCAT CAATCCGCTG CCTTCTATCG CGATGTCGCT
CCCGGCGAGG CCAGCGTGTC CACCGCCCGC CAGCTCCAGG GTAAGGCCCT GTCGTACAAC
AACGTGGCCG ACACCGACGC CGCCCTGGAG TGCGTCAAGG GCTTCCAGAC GCCGGCCTGC
GTCATCGTCA AGCACGCCAA TCCCTGCGGC GTGGCCTGCT CGGGGACGTT GCGGGAGGCC
TACGACCGGG CCTTCGAGGT CGATCCGACC TCCGCCTTCG GCGGCATCAT CGCCTTCAAC
GATACCGTCG ACGCCGAGCT GGCCGGCGCC ATCCTCGATC GCCAGTTCGT CGAGGTGGTC
ATCGCCCCGG AGGTCAGTGA CGAAGCCCTG TCGCGCTTCG CCGCCAAGGC CAACGTGCGA
GTCCTGCAGA CCGGCCGCTG GCCGCAGCAT CCGGGCGCGG ATCTGGAGCT CAAGCGGGTG
CGTGGCGGCC TCCTGGTGCA GGACCGGGAC ACCGCGGTGG TTGATCCGGC CGACCTGCGG
GTGGTCACCA AGCGCCAGCC CACCGATGCC GAGTGGGCCG ACCTGCGCTT CGCCTGGGAG
GTGGTGCGGC ACGTGAAGTC CAACGCCATC GTTTTCGCCG GCGGGCAGCG CACCCTCGGC
GTGGGGGCCG GGCAGATGAG CCGTGTCTTC AGTACCCGTA TTGCCTGCGA GAAGGCGGCC
GATGCGGGCC TGGCGCTGCA GGGCTCGGTC CTGGCCTCTG ACGCCTTCTT CCCGTTCCGC
GACGGCGTCG ATCAGGCTGC CGAGGCCGGC GCCGCCGCCG TGATCCAGCC CGGTGGCTCG
ATGCGGGATC AGGAGGTCAT CGATGCCGCC GACGAGCACG GTCTGGCCAT GGTCTTCACC
GGGATGCGCC ACTTCCGCCA CTGA
 
Protein sequence
MATNDGVRPL RRALISVSDK SGVEGFARAL HEQGVEILST GGTARLLGEA GIPVREVSAE 
TGFPEIMDGR VKTLHPRIHG GLLGRRGTDD AVMDEHGIGP IDLLCVNLYP FEQAVAAEGC
TLTDAIENID VGGPAMIRAA AKNHADVAVV TESSAYGLVL DELQRLGGTS RALRHHLATR
AFSHTARYDG AIAAYLSQRD EQGEQQGDFP AIWTLQVEKV ADMRYGENPH QSAAFYRDVA
PGEASVSTAR QLQGKALSYN NVADTDAALE CVKGFQTPAC VIVKHANPCG VACSGTLREA
YDRAFEVDPT SAFGGIIAFN DTVDAELAGA ILDRQFVEVV IAPEVSDEAL SRFAAKANVR
VLQTGRWPQH PGADLELKRV RGGLLVQDRD TAVVDPADLR VVTKRQPTDA EWADLRFAWE
VVRHVKSNAI VFAGGQRTLG VGAGQMSRVF STRIACEKAA DAGLALQGSV LASDAFFPFR
DGVDQAAEAG AAAVIQPGGS MRDQEVIDAA DEHGLAMVFT GMRHFRH