Gene Haur_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2074 
Symbol 
ID5733962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2582192 
End bp2583241 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content53% 
IMG OID641279215 
Productcobalamin synthesis protein P47K 
Protein accessionYP_001544842 
Protein GI159898595 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0490287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGCT GTTTTGAAGG AGTTTCTATG AGCGAAGCAC CAGCCATTCC GATGACGATT 
TTGACTGGCT TTTTAGGTGC AGGGAAAACA ACCGTCTTGA ATCGCTTGTT GCGTGAGGCC
CATGGTCGCA AAATCGCCGT GCTCGTCAAC GATTTTGGCG CGATTAATAT CGATGCGCAG
TTGGTCGTTG GTATTGAACG CAACGACATT GTAAATCTGG CGAATGGCTG TATTTGTTGT
ACGATTCGCG AAGATTTATT GACTGCGACG CTCGCATTGC TTGATCGTGC AGAGCGGCCT
GATGCTATTA TTGTGGAAGC GAGTGGTATC TCTGACCCTC TGGCGATCGC ATGGACCTTC
CGTTCGCCCG CGTTACGCCC GCACATTACC CTTGATGCCA TTGTGGCGGT CGTTGATGCT
GAGCGCATTT ACGAACAACG AGAACAGGTA ATGCAGGTCG TTGATCAAAT TGCTGCTGCC
GATATGGTGG TGATCAATAA AATCGATTTA GTTCCTCCTC TCCACATTCA CGCGGTGATG
ACGTGGATTC AGTCCATCGT ACCTCGTGCA CGCATTGTGG CTGCGGAGTA CGGCGATGTT
CCTGTTCAGG TGCTTCTGGG AAGCGGCATC TATCGTATTG CGTTGCTGCC GAATCAGGAA
GTCCCTGAAC CGCATACGCA TCATCACGAT CACGAATGGC AAACCTGGCA CTATCAAACC
ACGCAACCAT TTCATCTGCG CCGCCTGCAA CATGCCTTGC ACCACTTACC ACCTTCCATT
TTTCGCGCCA AAGGGATTGT CGCTTTAGCC GAAGCACCGG ACCGCCAAGC GATTGTTCAG
GTTGTGGGCA ACCGCGCGAG TGTGCAGCTG AGTACACCTT GGGGGCTAAC CAGCCCCTAC
AGCCAACTCG TGGTGATTGG CCAGCGCAAG CGTTTTGATG TCGTGGCCCT ACGCCAGCAA
TTTCATGCCT GTTTGGCATC AGGTGATCAC GAACTGTGCG ATCAACGCCC AAGCGCCAAT
GCATGGTCGC ACCCAGATCA GGCTCCATGA
 
Protein sequence
MTRCFEGVSM SEAPAIPMTI LTGFLGAGKT TVLNRLLREA HGRKIAVLVN DFGAINIDAQ 
LVVGIERNDI VNLANGCICC TIREDLLTAT LALLDRAERP DAIIVEASGI SDPLAIAWTF
RSPALRPHIT LDAIVAVVDA ERIYEQREQV MQVVDQIAAA DMVVINKIDL VPPLHIHAVM
TWIQSIVPRA RIVAAEYGDV PVQVLLGSGI YRIALLPNQE VPEPHTHHHD HEWQTWHYQT
TQPFHLRRLQ HALHHLPPSI FRAKGIVALA EAPDRQAIVQ VVGNRASVQL STPWGLTSPY
SQLVVIGQRK RFDVVALRQQ FHACLASGDH ELCDQRPSAN AWSHPDQAP