Gene Haur_2062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2062 
Symbol 
ID5733950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2573262 
End bp2574455 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content52% 
IMG OID641279204 
Productcobalamin synthesis protein P47K 
Protein accessionYP_001544831 
Protein GI159898584 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.410151 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAC TTCCCGTTAC TGTGCTTTCT GGGTTTCTTG GTGCCGGTAA GACAACCGTC 
TTGAATCATG TCCTAACCAA TCGTGCAGGC CTCCGCGTCG CCGTGATTGT AAACGATATG
AGTGAGATCA ATATTGATGC CCAATTGATC ACCGAAGGGA CGGCCCAGCT CAGTCGCACG
AAGGAAGCCT TGGTCGAACT GTCAAACGGC TGTATTTGTT GTACCCTACG CGATGATCTG
CTACGCGAAG TCGCTCGTTT GGCACGCGAT GGGCGCTTCG ATTATCTGTT GATTGAATCA
ACTGGCATTT CAGAGCCATT ACCGGTCGCG ATGACCTTTA GCTTTGAGAC CCCTGATGGG
ATTGACCGCC TCGTCGATAT CGCTCAGTTG GATACCATGG TAACGGTTGT TGATGCCCAT
ACATGGCTTG CTGACTATCG TGCTGGTCAG GCGTTGCATA CGTTGGATAT GGGAATCAGT
CCTGCTGATC ACCGCACCAT TGCCGATCTG CTTATTGACC AAGTTGAGTT TGCCAATGTT
ATTGTGCTGA ACAAGATCGA TTTAGTTGAT ACGCGGCAAC TTCATGAACT CGAAGGGGTC
TTGCATACCT TGAACCCGGA TGCGCGAGTT CTGCATGCAA CCAATGGTGT CATTGAACCA
ACAGCGATTC TCCATACCGG ATTATTTGAC ATGGAGCGTG CGCAACAATC CGCTGGCTGG
ATCAAAGAGC TGAATGGTGA GCATACCCCC GAAACCGAAG CGTATGGAAT TGGCAGTGTT
GTGTTTCGCG CACGGCGACC GTTTCATCCG CAACGGCTAT TGAGCGTGCT CACCGGACCA
GAACTCCAAC CCGTCCTGCG TTCGAAAGGG GTCTTGTGGC TTGCGTCACG CCATGATCAC
GGGTTGCGGT GGTCGTTAGC GGGGAAGATT GCCCGGGTTT CAGACAGTGG TGCGTGGCTT
GCCGCGACTC CTGACGATAC GTGGCCACAA AACGATCAAG TAGGCATATA CATCGAACGG
TATTGGCAAG AACCGTTTGG TGATCGCCGC CAAGAGTTAG TGTTTATTGG CATTGATATG
CCACATGAGC AATTGGTCGC AAAACTCGAA CACGCGTTAT TAACCGACCA AGAACTTGCG
GCTGGCCCGC CGCTGTGGAA GCGATTTGAA GATCTGTTTC CGCATTTTAA CTGA
 
Protein sequence
MAKLPVTVLS GFLGAGKTTV LNHVLTNRAG LRVAVIVNDM SEINIDAQLI TEGTAQLSRT 
KEALVELSNG CICCTLRDDL LREVARLARD GRFDYLLIES TGISEPLPVA MTFSFETPDG
IDRLVDIAQL DTMVTVVDAH TWLADYRAGQ ALHTLDMGIS PADHRTIADL LIDQVEFANV
IVLNKIDLVD TRQLHELEGV LHTLNPDARV LHATNGVIEP TAILHTGLFD MERAQQSAGW
IKELNGEHTP ETEAYGIGSV VFRARRPFHP QRLLSVLTGP ELQPVLRSKG VLWLASRHDH
GLRWSLAGKI ARVSDSGAWL AATPDDTWPQ NDQVGIYIER YWQEPFGDRR QELVFIGIDM
PHEQLVAKLE HALLTDQELA AGPPLWKRFE DLFPHFN