Gene Haur_1060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1060 
Symbol 
ID5732964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1212137 
End bp1213168 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content50% 
IMG OID641278195 
Productcob(I)alamin adenosyltransferase 
Protein accessionYP_001543836 
Protein GI159897589 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG2109] ATP:corrinoid adenosyltransferase
[COG3411] Ferredoxin 
TIGRFAM ID[TIGR00708] cob(I)alamin adenosyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATG CCCCACTGAC GATGCGAGCC TATGGCCGCC ATTTATTTAT TTGTACTGGT 
GATCAATGTA ATGGCGACGC TGACGGCGCG GCCTTAGCCC AACATGCGCT TGAGCAACTC
GGCGATCGGC GCAAATTACG CAATCCCGAA CGGGTCAAAT GCTCTACGGT CAGTTGTTTG
GGTGTTTGCC AACGCGGCCC GATTGCGGTG GTTTATCCCG AAGGCATTTG GTATCAGCAG
CTTGATCACG ATTTAGTTGA ACAAATTGTG CAAGAGCATT TGATCAATGG CGAGCCAGTT
GAGGCAGCGA TTTTCCATCG GCTGTATCCA GCGGGCCAAG AGCCAGAATA TGCCCCAGCA
GTGCGCGGCG ATCAAGCATT TAATCCAATT GAATTACAAC CCGATGATGC GGTTGAGCAA
TTGGAACCAA TCGTCGTTGC TGGCACGCAA CTGCATTCAA CCGAAGAAAA ACAGCGCTAT
CGCGAACAAG TACGCAAATT GCGCAAAGGC AAAAAAGGCC TTGTGATCGT CAATACCGGT
AATGGCAAGG GCAAAACGTC AGCGGCATTG GGTGTCATGA CTAGGGCGTG GGGTCGCGAT
CTCAAGGTTA AAGTCATTCA ATTTCTCAAA CACGAAAATG CCAAGTTTGG CGAATCGCGG
GCGGCTGCCA AGATGGAAAT TGAGTTTGGT GGCACTGGCG ATGGCTTCAC GTGGACCTCG
AAAGATCTTG ATGCGACCAA AGCCAAAGCC TTACATGGCT GGGAATTGGC TAAAACCGCG
ATTAGCTCGA ATCAATATCA GATTGTCATT CTCGATGAAT TTACCTATGT GATGGCTTTT
GGTTGGCTCG ATGTCAACGA GGTTGTGGCT TGGTTGGCGG CCAACAAGCC TGAATTATTG
CATGTGATTA TTACAGGCCG CGATGCACCT GCTGCCCTGA TCGAACACGC CGACCTTGTA
ACCGAGATGC GCGAAATTAA ACACCCGTTT ACGACTCAAG GCATTCGTGC CCAGATCGGG
ATTGACTTCT GA
 
Protein sequence
MSDAPLTMRA YGRHLFICTG DQCNGDADGA ALAQHALEQL GDRRKLRNPE RVKCSTVSCL 
GVCQRGPIAV VYPEGIWYQQ LDHDLVEQIV QEHLINGEPV EAAIFHRLYP AGQEPEYAPA
VRGDQAFNPI ELQPDDAVEQ LEPIVVAGTQ LHSTEEKQRY REQVRKLRKG KKGLVIVNTG
NGKGKTSAAL GVMTRAWGRD LKVKVIQFLK HENAKFGESR AAAKMEIEFG GTGDGFTWTS
KDLDATKAKA LHGWELAKTA ISSNQYQIVI LDEFTYVMAF GWLDVNEVVA WLAANKPELL
HVIITGRDAP AALIEHADLV TEMREIKHPF TTQGIRAQIG IDF