Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2074 |
Symbol | |
ID | 5733962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2582192 |
End bp | 2583241 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279215 |
Product | cobalamin synthesis protein P47K |
Protein accession | YP_001544842 |
Protein GI | 159898595 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0490287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCGCT GTTTTGAAGG AGTTTCTATG AGCGAAGCAC CAGCCATTCC GATGACGATT TTGACTGGCT TTTTAGGTGC AGGGAAAACA ACCGTCTTGA ATCGCTTGTT GCGTGAGGCC CATGGTCGCA AAATCGCCGT GCTCGTCAAC GATTTTGGCG CGATTAATAT CGATGCGCAG TTGGTCGTTG GTATTGAACG CAACGACATT GTAAATCTGG CGAATGGCTG TATTTGTTGT ACGATTCGCG AAGATTTATT GACTGCGACG CTCGCATTGC TTGATCGTGC AGAGCGGCCT GATGCTATTA TTGTGGAAGC GAGTGGTATC TCTGACCCTC TGGCGATCGC ATGGACCTTC CGTTCGCCCG CGTTACGCCC GCACATTACC CTTGATGCCA TTGTGGCGGT CGTTGATGCT GAGCGCATTT ACGAACAACG AGAACAGGTA ATGCAGGTCG TTGATCAAAT TGCTGCTGCC GATATGGTGG TGATCAATAA AATCGATTTA GTTCCTCCTC TCCACATTCA CGCGGTGATG ACGTGGATTC AGTCCATCGT ACCTCGTGCA CGCATTGTGG CTGCGGAGTA CGGCGATGTT CCTGTTCAGG TGCTTCTGGG AAGCGGCATC TATCGTATTG CGTTGCTGCC GAATCAGGAA GTCCCTGAAC CGCATACGCA TCATCACGAT CACGAATGGC AAACCTGGCA CTATCAAACC ACGCAACCAT TTCATCTGCG CCGCCTGCAA CATGCCTTGC ACCACTTACC ACCTTCCATT TTTCGCGCCA AAGGGATTGT CGCTTTAGCC GAAGCACCGG ACCGCCAAGC GATTGTTCAG GTTGTGGGCA ACCGCGCGAG TGTGCAGCTG AGTACACCTT GGGGGCTAAC CAGCCCCTAC AGCCAACTCG TGGTGATTGG CCAGCGCAAG CGTTTTGATG TCGTGGCCCT ACGCCAGCAA TTTCATGCCT GTTTGGCATC AGGTGATCAC GAACTGTGCG ATCAACGCCC AAGCGCCAAT GCATGGTCGC ACCCAGATCA GGCTCCATGA
|
Protein sequence | MTRCFEGVSM SEAPAIPMTI LTGFLGAGKT TVLNRLLREA HGRKIAVLVN DFGAINIDAQ LVVGIERNDI VNLANGCICC TIREDLLTAT LALLDRAERP DAIIVEASGI SDPLAIAWTF RSPALRPHIT LDAIVAVVDA ERIYEQREQV MQVVDQIAAA DMVVINKIDL VPPLHIHAVM TWIQSIVPRA RIVAAEYGDV PVQVLLGSGI YRIALLPNQE VPEPHTHHHD HEWQTWHYQT TQPFHLRRLQ HALHHLPPSI FRAKGIVALA EAPDRQAIVQ VVGNRASVQL STPWGLTSPY SQLVVIGQRK RFDVVALRQQ FHACLASGDH ELCDQRPSAN AWSHPDQAP
|
| |