Gene Haur_3408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3408 
Symbol 
ID5735269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4294032 
End bp4295414 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content50% 
IMG OID641280555 
ProductVWA containing CoxE family protein 
Protein accessionYP_001546172 
Protein GI159899925 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAAT CCGAGCTATT AAATCGTCGC CAAGTGCTCT ATTGGCGTAT GCTCAGCACT 
ATGTTTGGCT ATGACCAGCA GGGCGAAAAT TTCGACAGCA TGAGCCATGA GATTGCCCAA
GATCTGGCCT TGCCTGAGTC GATTTTGCAC CCAACACTCT CGCTGGAGCA ATTGTTTCAA
CGTTACCCTG AGCTTGAGCC GGAGTTCAAT CTGACTGAGC TTGACGATCG CCAAGATCCT
ACGACTTTGC GCCGTTCGTT AATTATTTCG AAGTTGTTGC TGAATGTCTT TGGCCCTCAA
ACCCAAAAAC GCTCGATCAG CGCTGCCGAG TATGCCCAAT GGCTCAAAGA TGTGGCCCAT
CTTGAACGTT GTTTGGGTTT TCAGCCTGGA GCGTTGCGTC AAAGCCAACC TGGTCAAGGC
CAAGCTAGCC AACCTGGTGG TTTGCAAGGT GGGCAGGGCG TTGGCTCAGG CTTCAATCTC
TCCGAAGAAG AGTTGCGCCA AGTTATCCAA GGGCTGGAAA AAGATTTGAT CAAGCGCATG
GCTTTGCGCG AAGTGCTGCA AGATAATCGG CTTGCCGCCC AACTTACGCC TTCGATGGCG
GTGGTTGAGC AATTGCTGCG CGATAAAAGC CATCTTTCGG GCAATGCCTT GATTAACGCC
AAACGCCTGA TCAAGCAATA TGTTGATGAA TTGGCCGATG TGTTGCGTTT GCAGGTGATG
CAAGCCGTTT CAGCCAAAAT CGATCGTTCA GTGCCACCTA AGCGGGTGTT TCGCAACCTC
GATTTGAAAC GCACAATTTG GCGCAATCTG ACCAATTGGA ATTCCAATGA AGGCCGTTTG
TATGTTGATC GCTTGTATTA TCGTCAAACT GCCAAAAAAC GCACCCCAAT GCGCATGATC
GTGGTCGTCG ATCAATCTGG CTCGATGGTT GATGCCATGG TGCAATGCAC AATTCTGGCT
TCGATTTTTG CCGGTTTGCC CCATGTTGAT ATGCATTTGA TCGCCTTCGA CACGCGCATG
CTTGATCTCA CGCCTTGGGT GCACGACCCG TTTGAGGTAT TGCTGCGCAC TCAGCTTGGC
GGCGGCACAA GCATCAACGA AGCCTTGCTC TTTGCCAGCG AAAAAATTCA AGAGCCACGC
AAAACCGCCG TGGTGCTGAT CACCGATTTT TACGAAGGCG GTTCGGATCA AGTGCTGCTC
GATACAATCA AAGCCATGAT CGAATCGGGT GTGCATTTTA TTCCGGTCGG GGCGGTCACC
AGTTCGGGCT ATTTCAGCGT CAACGATTGG TTCCGTACCA AGCTCAAAGA AATGGGTCGG
CCAATTTTTG CTGGCAGCCC TCGCAAGCTG ATCGAACAAA TTAAGCAATT TATTACCTTG
TAA
 
Protein sequence
MNQSELLNRR QVLYWRMLST MFGYDQQGEN FDSMSHEIAQ DLALPESILH PTLSLEQLFQ 
RYPELEPEFN LTELDDRQDP TTLRRSLIIS KLLLNVFGPQ TQKRSISAAE YAQWLKDVAH
LERCLGFQPG ALRQSQPGQG QASQPGGLQG GQGVGSGFNL SEEELRQVIQ GLEKDLIKRM
ALREVLQDNR LAAQLTPSMA VVEQLLRDKS HLSGNALINA KRLIKQYVDE LADVLRLQVM
QAVSAKIDRS VPPKRVFRNL DLKRTIWRNL TNWNSNEGRL YVDRLYYRQT AKKRTPMRMI
VVVDQSGSMV DAMVQCTILA SIFAGLPHVD MHLIAFDTRM LDLTPWVHDP FEVLLRTQLG
GGTSINEALL FASEKIQEPR KTAVVLITDF YEGGSDQVLL DTIKAMIESG VHFIPVGAVT
SSGYFSVNDW FRTKLKEMGR PIFAGSPRKL IEQIKQFITL