Gene Haur_4211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4211 
Symbol 
ID5736923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5365387 
End bp5366634 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content53% 
IMG OID641281366 
Productvon Willebrand factor type A 
Protein accessionYP_001546971 
Protein GI159900724 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.290242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC CTGTAGCACT TTCAGCAGTT TGGAGCCGCG AACCTTTGCC AAGCGGCACC 
AGCCAAGTTA ATTATGTCTT GATTCAGGCC AAACCACATC ATGTGCCGAC TGTCCAAGCG
GCTCCGCCAC TCAACTTTTG TTTGGTGCTT GATCGCTCTG GTTCGATGGC TGGCGATAAA
ATTCAACATT TGCGCGAAGC TGTGCGTGAA ATTGTGGCCA ACTTACGTCC AATCGATGCC
GTGAGCATTG TGTTGTTCGA TGATACCTTG GAAGTTCTCG TGCCAGCCCG TTTGGCCGAC
GATCTCCCAG CCTTGCAAAA TGCGATCGAA TCAATCGGCG AGCAAGGTGG CACGGCCATG
TCGTTGGGCT TGCAAGCAGG CCTTGCCGAA TTGCAAAAAT TCCAGGCCGC CGATCGAGTT
GGCCGCGTGC TGCTTTTGAC CGACGGCCAA ACCTGGGGCG ATGAAGATAC CTGCCGCGAT
TTAGCCAAAC AAATTGGCGA TTTAGGCGTT TCGATCACAG CACTGGGCTT GGGCACTGAA
TGGAACGAGG CCTTGCTCGA CGATTTGGCT ACCGCATCCA ACGGCGAATC GGATTATATT
GCCGACCCCA GCCAAATTAG CAAATATTTC CAACAAACCT TGCAAAGCGC CCAAACTACC
ACCGTGGTCA ATGCGCGGTT GCTGTTGCGT TTGCTGCCTG GAGTTACCCC ACGCGCAGTT
TATCGCGTCC AGCCAACGAT CGCCAACCTT GGCTACAAGC CGATTGGTGA ACGCGAAGTC
ACGGTCAGCA TTGGCGAAAT TGCTGGCGAT GGAGCCAGTG TTTTGGTCGA TGTGATGCTG
CCAGAGCGTG AAGCGGGTAC GTTCCGCATC GCCCAAGCTG AATTGCAATA CGATGCCCCA
GTGCTTGGTA TCAAAGAAGG CAAAATTAAA ATTGACATTC CTTTGAGCTT TAACGTCGAT
CCCAAGGCCA GCGTGGTCAA TCCGCCAATT ATGAACACGG TCGAAAAAGT GACCGCCTTC
AAATTGCAAA CACGGGCACT TTCCGAGGCC GAGGCCGGCA ATATTGGCAG CGCAACCCAA
AAATTACGCG CCGCCGCCAC CCGTTTGCTC GATTTAGGCG AAACTGAGTT GGCCCAAACC
ATGGAACAAA GTGCCCAACA ACTTGAGGCT GGTGGTCAAA TCGCAGCCGC TGATCAAAAA
GCCCTGCGCT ACGCCACCCG CAAACTAACC CAAAAATTAG AAGAGTAA
 
Protein sequence
MTEPVALSAV WSREPLPSGT SQVNYVLIQA KPHHVPTVQA APPLNFCLVL DRSGSMAGDK 
IQHLREAVRE IVANLRPIDA VSIVLFDDTL EVLVPARLAD DLPALQNAIE SIGEQGGTAM
SLGLQAGLAE LQKFQAADRV GRVLLLTDGQ TWGDEDTCRD LAKQIGDLGV SITALGLGTE
WNEALLDDLA TASNGESDYI ADPSQISKYF QQTLQSAQTT TVVNARLLLR LLPGVTPRAV
YRVQPTIANL GYKPIGEREV TVSIGEIAGD GASVLVDVML PEREAGTFRI AQAELQYDAP
VLGIKEGKIK IDIPLSFNVD PKASVVNPPI MNTVEKVTAF KLQTRALSEA EAGNIGSATQ
KLRAAATRLL DLGETELAQT MEQSAQQLEA GGQIAAADQK ALRYATRKLT QKLEE