Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0639 |
Symbol | |
ID | 5732537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 736679 |
End bp | 737887 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641277766 |
Product | cobalamin synthesis protein P47K |
Protein accession | YP_001543415 |
Protein GI | 159897168 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000744918 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAAGT TGCCGCTTCC CGTAACTGTA CTTTCTGGTT TTCTTGGTGC AGGGAAAACC ACCTTATTAA ATCATATTTT GGCCAACCGC GAGGGCTTGC GGGTCGCCGT CATCGTCAAT GATATGAGCG AAGTTAATAT CGATGCGCGG TTGGTTGGTC AAGGCGAACT TGCACTCAAT CGGGTCGAAG AACAATTGAT CGAGCTCAGC AATGGTTGTA TTTGCTGTAC CTTGCGTGAA GATTTGTTGC TTGAGGTTAG TAAGCTAGCT CGTGCTGGCC GATTCGATTA TCTGCTGATC GAATCGACTG GTATTTCGGA GCCGTTGCCC GTCGCCGAAA CCTTTACTTT CGAGGATGAA TCGGGCCTCA GTTTGGGCGA AATTGCCAAG CTCGATACGA TGGTAACCGT GGTTGATGCA CTCAATTTCT TACACGATTT CAATGCTGCT CACGATTTAC GCGATCGTGA TTTGGCGATC GACGAGGCCG ATGAGCGGAC TTTGGTCGAT TTATTGATCG ATCAGATTGA GTTTTGCAAT GTGTTAATCA TTAATAAAAC TGATTTGGTT AGTGCCGAAC AATTAATCCA TTTGCATGAA TTGCTAAATA AACTCAATCC GCAAGCCAAG ATTTTTCATG CACAACATGG TCAAGTGCCA CTCAACGAAA TTCTGAATAC AGGCTTGTTT GATTTTGATC AAGCAAGCGC TGCACCTGGT TGGTTGGCCG AATTGCGCGG CGAACATACG CCCGAAACCG AGGAATATGG CATTCGCAGC TGGGTATATC GCGCTCGACG GCCATTTGTC GCTCAGCGTT TTTGGGATTT TGTCAATAAT GATTGGCCTG GCGTAATTCG CGCCAAAGGC TTCTTTTGGG TAATTTCGCA GCCCGAGACG GCTGGCTTGC TTTCGCAGGC GGGCCAAAAT TGCCGGGTCG AGCCAGCTGG ACAATGGTGG GCTGATAGCG ATCAGAGCGA ATGGCCGGAA ACTGCCGAGG AACGCACCGA AATTGAAGCC TTGTGGGATG AGCAGGTCGG CGACCGTCGG CAGGAACTCG TCTTTATCGG CCAAGATTTT GATCAACAAT GGTTACAGCA AACGTTAGAT GCCTGTTTGG TTAGCGATCA CGAATGGCAG CAACCTGCCG CAACGTGGCA AATCGATGAT CCGTTTGCTG ATTGGAATGA GTACGAAGCG ATAGCTTAG
|
Protein sequence | MSKLPLPVTV LSGFLGAGKT TLLNHILANR EGLRVAVIVN DMSEVNIDAR LVGQGELALN RVEEQLIELS NGCICCTLRE DLLLEVSKLA RAGRFDYLLI ESTGISEPLP VAETFTFEDE SGLSLGEIAK LDTMVTVVDA LNFLHDFNAA HDLRDRDLAI DEADERTLVD LLIDQIEFCN VLIINKTDLV SAEQLIHLHE LLNKLNPQAK IFHAQHGQVP LNEILNTGLF DFDQASAAPG WLAELRGEHT PETEEYGIRS WVYRARRPFV AQRFWDFVNN DWPGVIRAKG FFWVISQPET AGLLSQAGQN CRVEPAGQWW ADSDQSEWPE TAEERTEIEA LWDEQVGDRR QELVFIGQDF DQQWLQQTLD ACLVSDHEWQ QPAATWQIDD PFADWNEYEA IA
|
| |