Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4200 |
Symbol | |
ID | 5736062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5355563 |
End bp | 5356573 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281355 |
Product | cobalamin synthesis protein P47K |
Protein accession | YP_001546960 |
Protein GI | 159900713 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000951795 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACGC CGATTCCAAT GACGATTTTG ACCGGATTTT TGGGCGCAGG CAAAACGACG TTGCTCAATC GGCTGCTTAG TGCCCAGCAT GGCCTCAAAA TCGCGGTGCT GGTTAACGAT TTTGGCGAGA TTAATATTGA CTCGCAGTTG GTAGTTGGGG TTGAAAATGA CGCTGTAATT AATTTGGCGA ATGGCTGTAT TTGCTGCACC ATTCGCGAGG ATTTACTCAC CACCACCCTC GAATTGCTTG AGCGCGATGA TCGACCTGAA TATATCATCG TCGAAGCCAG CGGCGTTTCC GACCCGGTTT CGGTGGCATT AACCTTCCGT TTGCCCGCCC TGCGCTCGTT GATCAATCTC GATTCGATTG TGGCGGTCGT TGATGCTGAG AGTATTCACC AACAACGTGA ACAATTGATT CAAGTCGTCG ATCAAATTGC CGCCGCCGAT CTTGTTGTGA TCAATAAAAT CGATTTGGTT GATGCTGCCC AACAGCAACG GGTGATTGCC TGGATTCAGA CGATTGTGCC ACGGGCGCGA ATTTTGACCG CTGAATATGG CGAGGTTCCG GTTGATTTGC TGCTGGGAGT TGGTCAATAT CGCATCGATT TGCAGGCTGA AGCGTATCCC ACCCAGCATC AACATAACGA AGAATGGCAA ACCTGGAATT ACCAAACTGA TCAGCCTTTT ACCATGAGCA GCCTGCAACG AGCCTTCCAA CAATTGCCAA CCGCGATTTT TCGTGCTAAA GGCATTGTAT ATTTGGCCGA AGCACCTGAA CGCCGCGCAA TTGTTCAGTT GGCGGGCAAA CGTACTAGTT TGCGGCTCAG TGAGCCATGG GGCGCAGCCA CTCCGTACAG CCAAATTGTG ATAATTGGCC GGAGCAATAG CTTTGATCCA GCTGAATTGA CCCACCATTT TAATGCTTGT TTGGCAGATG CCACGCAAGA ACCACGCGAA GAAATTCTGA CTGTGGCCGA ATGGCGACGC AAATACCAAG CCCAATCGTA A
|
Protein sequence | MTTPIPMTIL TGFLGAGKTT LLNRLLSAQH GLKIAVLVND FGEINIDSQL VVGVENDAVI NLANGCICCT IREDLLTTTL ELLERDDRPE YIIVEASGVS DPVSVALTFR LPALRSLINL DSIVAVVDAE SIHQQREQLI QVVDQIAAAD LVVINKIDLV DAAQQQRVIA WIQTIVPRAR ILTAEYGEVP VDLLLGVGQY RIDLQAEAYP TQHQHNEEWQ TWNYQTDQPF TMSSLQRAFQ QLPTAIFRAK GIVYLAEAPE RRAIVQLAGK RTSLRLSEPW GAATPYSQIV IIGRSNSFDP AELTHHFNAC LADATQEPRE EILTVAEWRR KYQAQS
|
| |