Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2000 |
Symbol | |
ID | 4710417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2204773 |
End bp | 2206356 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639856473 |
Product | phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001003566 |
Protein GI | 121998779 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00113973 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACCA ACGACGGCGT GCGGCCCCTG CGGCGGGCCC TGATCAGCGT TTCCGATAAG AGCGGGGTGG AGGGCTTTGC CCGCGCCCTG CATGAGCAAG GCGTCGAGAT CCTCTCGACC GGTGGTACGG CCCGCCTGCT GGGTGAGGCC GGGATCCCGG TGCGGGAGGT CTCGGCCGAG ACCGGCTTCC CGGAGATCAT GGACGGTCGC GTCAAGACCC TGCATCCGCG CATCCACGGC GGGCTGCTGG GCCGGCGCGG CACGGATGAC GCGGTCATGG ACGAGCACGG CATTGGTCCC ATCGATCTGC TTTGCGTCAA CCTCTACCCC TTCGAGCAGG CCGTGGCCGC CGAGGGCTGC ACCTTGACCG ACGCCATCGA GAATATCGAT GTCGGCGGGC CGGCGATGAT CCGTGCGGCC GCCAAGAACC ACGCCGACGT GGCGGTGGTG ACCGAGTCGT CGGCCTATGG CCTGGTCCTC GATGAGCTCC AGCGCCTTGG CGGGACCAGC CGCGCCCTGC GCCACCATCT GGCGACCCGA GCGTTCAGCC ACACCGCGCG CTACGACGGC GCCATCGCTG CCTACCTGAG CCAGCGCGAC GAACAGGGCG AGCAGCAGGG CGATTTCCCG GCGATCTGGA CGCTCCAGGT GGAGAAGGTC GCCGACATGC GTTACGGCGA GAACCCCCAT CAATCCGCTG CCTTCTATCG CGATGTCGCT CCCGGCGAGG CCAGCGTGTC CACCGCCCGC CAGCTCCAGG GTAAGGCCCT GTCGTACAAC AACGTGGCCG ACACCGACGC CGCCCTGGAG TGCGTCAAGG GCTTCCAGAC GCCGGCCTGC GTCATCGTCA AGCACGCCAA TCCCTGCGGC GTGGCCTGCT CGGGGACGTT GCGGGAGGCC TACGACCGGG CCTTCGAGGT CGATCCGACC TCCGCCTTCG GCGGCATCAT CGCCTTCAAC GATACCGTCG ACGCCGAGCT GGCCGGCGCC ATCCTCGATC GCCAGTTCGT CGAGGTGGTC ATCGCCCCGG AGGTCAGTGA CGAAGCCCTG TCGCGCTTCG CCGCCAAGGC CAACGTGCGA GTCCTGCAGA CCGGCCGCTG GCCGCAGCAT CCGGGCGCGG ATCTGGAGCT CAAGCGGGTG CGTGGCGGCC TCCTGGTGCA GGACCGGGAC ACCGCGGTGG TTGATCCGGC CGACCTGCGG GTGGTCACCA AGCGCCAGCC CACCGATGCC GAGTGGGCCG ACCTGCGCTT CGCCTGGGAG GTGGTGCGGC ACGTGAAGTC CAACGCCATC GTTTTCGCCG GCGGGCAGCG CACCCTCGGC GTGGGGGCCG GGCAGATGAG CCGTGTCTTC AGTACCCGTA TTGCCTGCGA GAAGGCGGCC GATGCGGGCC TGGCGCTGCA GGGCTCGGTC CTGGCCTCTG ACGCCTTCTT CCCGTTCCGC GACGGCGTCG ATCAGGCTGC CGAGGCCGGC GCCGCCGCCG TGATCCAGCC CGGTGGCTCG ATGCGGGATC AGGAGGTCAT CGATGCCGCC GACGAGCACG GTCTGGCCAT GGTCTTCACC GGGATGCGCC ACTTCCGCCA CTGA
|
Protein sequence | MATNDGVRPL RRALISVSDK SGVEGFARAL HEQGVEILST GGTARLLGEA GIPVREVSAE TGFPEIMDGR VKTLHPRIHG GLLGRRGTDD AVMDEHGIGP IDLLCVNLYP FEQAVAAEGC TLTDAIENID VGGPAMIRAA AKNHADVAVV TESSAYGLVL DELQRLGGTS RALRHHLATR AFSHTARYDG AIAAYLSQRD EQGEQQGDFP AIWTLQVEKV ADMRYGENPH QSAAFYRDVA PGEASVSTAR QLQGKALSYN NVADTDAALE CVKGFQTPAC VIVKHANPCG VACSGTLREA YDRAFEVDPT SAFGGIIAFN DTVDAELAGA ILDRQFVEVV IAPEVSDEAL SRFAAKANVR VLQTGRWPQH PGADLELKRV RGGLLVQDRD TAVVDPADLR VVTKRQPTDA EWADLRFAWE VVRHVKSNAI VFAGGQRTLG VGAGQMSRVF STRIACEKAA DAGLALQGSV LASDAFFPFR DGVDQAAEAG AAAVIQPGGS MRDQEVIDAA DEHGLAMVFT GMRHFRH
|
| |