Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2062 |
Symbol | |
ID | 5733950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2573262 |
End bp | 2574455 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279204 |
Product | cobalamin synthesis protein P47K |
Protein accession | YP_001544831 |
Protein GI | 159898584 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.410151 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAAAC TTCCCGTTAC TGTGCTTTCT GGGTTTCTTG GTGCCGGTAA GACAACCGTC TTGAATCATG TCCTAACCAA TCGTGCAGGC CTCCGCGTCG CCGTGATTGT AAACGATATG AGTGAGATCA ATATTGATGC CCAATTGATC ACCGAAGGGA CGGCCCAGCT CAGTCGCACG AAGGAAGCCT TGGTCGAACT GTCAAACGGC TGTATTTGTT GTACCCTACG CGATGATCTG CTACGCGAAG TCGCTCGTTT GGCACGCGAT GGGCGCTTCG ATTATCTGTT GATTGAATCA ACTGGCATTT CAGAGCCATT ACCGGTCGCG ATGACCTTTA GCTTTGAGAC CCCTGATGGG ATTGACCGCC TCGTCGATAT CGCTCAGTTG GATACCATGG TAACGGTTGT TGATGCCCAT ACATGGCTTG CTGACTATCG TGCTGGTCAG GCGTTGCATA CGTTGGATAT GGGAATCAGT CCTGCTGATC ACCGCACCAT TGCCGATCTG CTTATTGACC AAGTTGAGTT TGCCAATGTT ATTGTGCTGA ACAAGATCGA TTTAGTTGAT ACGCGGCAAC TTCATGAACT CGAAGGGGTC TTGCATACCT TGAACCCGGA TGCGCGAGTT CTGCATGCAA CCAATGGTGT CATTGAACCA ACAGCGATTC TCCATACCGG ATTATTTGAC ATGGAGCGTG CGCAACAATC CGCTGGCTGG ATCAAAGAGC TGAATGGTGA GCATACCCCC GAAACCGAAG CGTATGGAAT TGGCAGTGTT GTGTTTCGCG CACGGCGACC GTTTCATCCG CAACGGCTAT TGAGCGTGCT CACCGGACCA GAACTCCAAC CCGTCCTGCG TTCGAAAGGG GTCTTGTGGC TTGCGTCACG CCATGATCAC GGGTTGCGGT GGTCGTTAGC GGGGAAGATT GCCCGGGTTT CAGACAGTGG TGCGTGGCTT GCCGCGACTC CTGACGATAC GTGGCCACAA AACGATCAAG TAGGCATATA CATCGAACGG TATTGGCAAG AACCGTTTGG TGATCGCCGC CAAGAGTTAG TGTTTATTGG CATTGATATG CCACATGAGC AATTGGTCGC AAAACTCGAA CACGCGTTAT TAACCGACCA AGAACTTGCG GCTGGCCCGC CGCTGTGGAA GCGATTTGAA GATCTGTTTC CGCATTTTAA CTGA
|
Protein sequence | MAKLPVTVLS GFLGAGKTTV LNHVLTNRAG LRVAVIVNDM SEINIDAQLI TEGTAQLSRT KEALVELSNG CICCTLRDDL LREVARLARD GRFDYLLIES TGISEPLPVA MTFSFETPDG IDRLVDIAQL DTMVTVVDAH TWLADYRAGQ ALHTLDMGIS PADHRTIADL LIDQVEFANV IVLNKIDLVD TRQLHELEGV LHTLNPDARV LHATNGVIEP TAILHTGLFD MERAQQSAGW IKELNGEHTP ETEAYGIGSV VFRARRPFHP QRLLSVLTGP ELQPVLRSKG VLWLASRHDH GLRWSLAGKI ARVSDSGAWL AATPDDTWPQ NDQVGIYIER YWQEPFGDRR QELVFIGIDM PHEQLVAKLE HALLTDQELA AGPPLWKRFE DLFPHFN
|
| |