Gene Haur_0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0047 
Symbol 
ID5731919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp62267 
End bp63649 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content51% 
IMG OID641277168 
ProductVWA containing CoxE family protein 
Protein accessionYP_001542827 
Protein GI159896580 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGAAC GAATTGTGGC CTTTGTCAAG GCGCTGCGAG CAGCCGGAGT TCGTGTGTCG 
CTCGCCGAAA GCCTTGATAG TTTTAATGCC TTAGAACAGC TTGGCATCAG CGATCGCCAG
CTTTTTCACG ATGCGCTGTT GGCAACCTTG GTCAAAGATG CTAGCCAGCA AGCGCAGTTC
GAGGAGCTAT TTCCGCTTTT TTTTGGCCAT GGCGATGCCC CGATGATCAG CGGAGCCAAT
GGCTTGAGCG GGCTAGATCC CGACGAATTA CAACAATTGC GCCAAGAAAT TGCCCAATTG
CGCGAACGGA TTCGCGAATT AATGCAACGG CTGATGGATG GCCAAAATCT CACGCCTGAA
GAATTGGCGG CACTAGCACG AAGTTCGGGC ATCAATCACA TTCAATCGCT CAACCAACGC
CGCTGGGTCG AACGGCGCAT GGAACAACAA ATGGGCCTGA ATGAATTTAA ACAGGCGCTG
GAAGCTTTGC TCAAACAATT GGAAGAAGGC GGGATGGATG CCGCCGCCTT GGCCGAAATT
ATGCAGCAAA TGCAGGGCAA TGCCCAAGCT ATGCGCGACC AAATTAGCCA ATTCGTTGGG
TCGGGCTTAG CCGAGCGCAT GAGCGACGAC TACAATCCGC AAACTGGTGA TGATCTCCAG
CATCGGCCAT TTGGCTCGCT CTCCGATGCT GATGTGCAGC GTATGCGTCA AGAAGTACGC
CGTTTGGCGG CTTTGTTGCG CTCACGAGCA GCGTTGCGCC AAAAACGCGA TAAAGCGGGT
CAGGTTGATA TCAAACGCAC CATGCGCAAC AATATGCGCT ACGACGGCGT GCCGATGAAA
TTGGAATATC GTAAAAAGCA ACAAAAACCT AAATTGGTAA TTATTTGCGA TATTTCGACC
TCGATGCGAC CTGTGGCTGA ATTTATGCTG CGTATGATTT ACGAACTGCA AGATCAAGTT
AGCAAAACCC ATTCATTTGC TTTTATCGCT AATTTGCACG ACATTACCGA GCAATTGAAT
GATAGCCGCG CCGATATTAG CGTCAATGAT GTGCTCGAAA GTATCCCGCC TGGCTACTAC
AACACCGACC TTGGTCATAG CCTCGATACG TTTTTGCATA GCCACCTTAG TACGGTCGAT
TGGCGTACTA CGGTGATTAT TGTGGGTGAT GGCCGCAATA ATTTCAACAA TCCACGGCTG
GAATCATTGC AAACAATTCG TCGCCATGCC AAGCGCTTAA TCTGGTTTAC TCCCGAAGAT
CGCTGGCAAT GGGGCACTGG CGATAGCGAT ATGCAGCTCT ACGCACCGCT TTGCGACCGT
GTGCATCTCG TGACCAACTT GGCTGAATTA ACGGCAGCGG TTGATCGGCT ATTGGCTAAC
TAG
 
Protein sequence
MHERIVAFVK ALRAAGVRVS LAESLDSFNA LEQLGISDRQ LFHDALLATL VKDASQQAQF 
EELFPLFFGH GDAPMISGAN GLSGLDPDEL QQLRQEIAQL RERIRELMQR LMDGQNLTPE
ELAALARSSG INHIQSLNQR RWVERRMEQQ MGLNEFKQAL EALLKQLEEG GMDAAALAEI
MQQMQGNAQA MRDQISQFVG SGLAERMSDD YNPQTGDDLQ HRPFGSLSDA DVQRMRQEVR
RLAALLRSRA ALRQKRDKAG QVDIKRTMRN NMRYDGVPMK LEYRKKQQKP KLVIICDIST
SMRPVAEFML RMIYELQDQV SKTHSFAFIA NLHDITEQLN DSRADISVND VLESIPPGYY
NTDLGHSLDT FLHSHLSTVD WRTTVIIVGD GRNNFNNPRL ESLQTIRRHA KRLIWFTPED
RWQWGTGDSD MQLYAPLCDR VHLVTNLAEL TAAVDRLLAN