Gene Haur_4210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4210 
Symbol 
ID5736922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5363894 
End bp5365159 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content52% 
IMG OID641281365 
Productvon Willebrand factor type A 
Protein accessionYP_001546970 
Protein GI159900723 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.466753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGTG AAGTTCAATT AACTGGTACG TTGGCTCGAC CGGCGTTGCC AGCCTTGCAA 
ACCCAGCAGG TTGTTTATTT ACTGCTGGAT ATTACGGCAA CACCAGCGGT TGCTCACGTT
CAAATGCCAG TTAATGTGAG CTTCGTGCTC GATCATAGTG GCTCGATGAA GGGCGACAAA
ATGCGCTGTG TGCGCGAAGC TACCCAACGC GCTCTGGGCT TGATGGGGCC GCAAGATATT
GTTTCGGTGG TGATTTTCGA CCATCGCCGC GAAACGATTA TCAGTGCTCA GCCTGTTCGC
AACGTTGCTG CCTTACAAGC TGAAGTTGGT AAAATCAAAG ATGCAGGTGG TACAAAAATC
GCACCTGCGC TCGAAGCTGC CTTGAATGAA ATTCGCCGTA GCCAAAATGC CAATACGATC
AGCCGCATTA TTTTGCTGAC CGACGGTCAA ACCGAGGGCG AACGCGATTG TTTGCGCTTG
GCCGAGGAAA TTGGCAAAGC TAGTGTGCCA TTGACGGCAC TGGGCGTTGG CGACGATTGG
AACGAAGATC TGTTGATCGA AATGGCGAAT CGCTCAGGTG GCGTTGCCGA ATATTTCAGC
AATCCCAACG ATATCGCCTC GTTCTTCCAA GGTGCGGTGC AGCAAGCCCA ATCGGCGGTG
GTGCAAAACT CAGCCTTGAC CTTGCGCTTT GTGCAGGGAG TTGAGCCACG CGCCCTTTGG
CAAGTAACCC CATTAATTCA ACAATTGCCC TATCGGCCAA TTAGCGATCG GGCGGTTGGC
GTGAGCCTCG GCGATATTTC CAAAGACGAA CATCGGATGG TGCTAATCGA AATGCTGGTT
GATCCCAAGC AGGCGGGCCA ATATCGGCTG GGCCAAATCG AGGTCAACTA CGATATTCCT
CAAATGCAGG TAGTTGGCGA AAAAGCTCGC TACGATGTCA TGTTGAATTT TGTGGCTGAT
CCGGCTCAGG CAACCGGAGT TGTGCCCCAA GTGATGAATA TTGTTGAAAA GGTCAGCGCC
CACAAGCTGC AAACTCGGGC CTTAGAAGAT TTGGCCGAGG GCAATATTGG TGCAGCGACC
CAAAAGCTTC AAGGTGCTGT GACCCGCTTG CTCAACCAAG GCGAAACCGA GCTAGCCCAA
ACCATGCAAC AAGAGATCGA AAATCTACAA ACCAATGGGC AAATGACCTC AGCTGGTCAA
AAAACCATCA AATTTGGTAC CCGCAAAACC GTGCGGCTCA GCGATTTGGA TCTACCAAAA
AGTTAG
 
Protein sequence
MAGEVQLTGT LARPALPALQ TQQVVYLLLD ITATPAVAHV QMPVNVSFVL DHSGSMKGDK 
MRCVREATQR ALGLMGPQDI VSVVIFDHRR ETIISAQPVR NVAALQAEVG KIKDAGGTKI
APALEAALNE IRRSQNANTI SRIILLTDGQ TEGERDCLRL AEEIGKASVP LTALGVGDDW
NEDLLIEMAN RSGGVAEYFS NPNDIASFFQ GAVQQAQSAV VQNSALTLRF VQGVEPRALW
QVTPLIQQLP YRPISDRAVG VSLGDISKDE HRMVLIEMLV DPKQAGQYRL GQIEVNYDIP
QMQVVGEKAR YDVMLNFVAD PAQATGVVPQ VMNIVEKVSA HKLQTRALED LAEGNIGAAT
QKLQGAVTRL LNQGETELAQ TMQQEIENLQ TNGQMTSAGQ KTIKFGTRKT VRLSDLDLPK
S