Gene Haur_2141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2141 
Symbol 
ID5734043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2695539 
End bp2697371 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content50% 
IMG OID641279282 
Productvon Willebrand factor type A 
Protein accessionYP_001544909 
Protein GI159898662 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTCAA TTCGCAAATC GAAGTTCGAA GAGCAGAGTA GAAACCTACA GATACAAAAG 
GGAACCAAAG AGGAGCAAGC AACGTTTGAT CGGTGTGGTC CACAATGCCT TATCAATTTA
ACCAAAAAAG AGGATGTTAT GCGACTAAAA CGAAGCAGCA TTGTGCTGAT CGCATTGATC
ATCAGCGCTT GTGGGGGAGA AGCGAGCCTA CCAACGATCA ATCCGCAACC ACAACGGCCA
GCGCCGCAGC CACGGCCAAC CAGCGCCGCC GACGATCAAT CAGCGCAATG GCCGACTGCC
GAAGCAACGA GCGTAGCGCC AGCGCCACAA CCAATGCCGA CTCAAGCAGC AGATGCCGGT
CAGCCAGTGC CAAATCCTGC TGCTGGTAAA CCCTTGGTTG ATACGTGGGA GCTGCCAACC
CAACCGATCG ATCCAAATCC AAATTACGCC TACGAACAAG ATCAAGAAAT CTTTGATTCG
ATGTATTTTA AAAATTATGG CACAAATCCA TTCGTGCGGA CAGAAACCGA CCCCTTATCA
ACCTTTGCGA TGGATATTGA CAGTGCTTCG TACAGCCTGA TGCGCAGTAG CATCAACCAA
GGCCTCTTAC CGCCAGCCGA TTCAGTGCGA GTCGAAGAAT ATCTGAACGC CTTTGATTAC
GAGTATCCCC AGCCCGAAGA TGGCGATTTT GCGATCTACA GCGAAGTAGC GCCATCGCCA
TTTGGCGGCC CCAACTACGA GCTAGTGCAA ATTGGCATTC AAGCTCGAAG TATCGAAGTA
GCTGATCGCA AGCCTGCCGC CCTTACCTTT GTGATCGATA CATCGGGATC GATGGCCCAA
GATAATCGCT TGGAAATGGT CAAAAATGCC CTGATTTATT TGGCTGGGCA ACTTGAGCCT
GACGATAGTT TGGCAATTGT GGCCTTTAAC GATGGAATGC GAGTGGTGTT AAACCCAACT
TCGGGCGAAA ATCAGATGGA TATCATCACC GCAATCAATT CACTTGAGCC AGCTGGCAGC
ACCAACGCCG AAGCTGGACT TTATAAAGGC TTTGAATTAG CCTGGCAAGC CTTCAAACCG
GAAGGCATCA ACCGGATTTT GCTCTGCTCA GATGGCGTGG CTAACAGCGG CATGACCGAA
CCAAGTCAAC TGCTCGCGAC CTTCCAACAA TATCTTGATG CAGGCGTTCA GCTTTCGACC
TATGGCGTGG GTATGGGCAA CTACAACGAC ATTTTGTTAG AGCAACTGGC CGACAAAGGC
GATGGCAATT ATGCCTATTT CGATTCAGCC GATGAAGCCC AACGCCTGTT TGGCGAGCAA
TTGACTGGTT CGCTGCAAAC CATCGGGCGC GAAGCCAAAA TCCAAGTTAA TTTTGACCCA
AATGTAGTGA AACGGTATCG CTTGATTGGC TATGAAAATC GTGCGGTAGC CGATAGCGAC
TTCCGCAACG ACAGTGTTGA TGGTGGCGAA GTTGGCGCGG GCCATAGTGT GACAGCGCTG
TATGAAATCA AGCGCCATCC TGATGCCCAA GGCCCAATCG CCCAAGTTAA TATTCGCTAT
ATCAGCATGG ATACTAACGC ACCAGTTGAA GAAAGCCTGA ATATTTCAAC GGCGCAAATT
CATAGCAGTT TTGATCGCGC CAGTGCGCGA ATGCACCTAG CAACGAGCGT CGCCGAATAC
GCCGAACTAT TACGCCATTC ACGCTGGAAT AACGGCACTG ATATCCTTGA TGTGCTTGAT
CTGGCTGAAG AAGCGGCGCT AGATTTACCC AATAATCAAA GTGCCGTTGA ATTTGTTACC
CTGCTACGGC GGGCTGAGCA GATGCACCAA TAA
 
Protein sequence
MRSIRKSKFE EQSRNLQIQK GTKEEQATFD RCGPQCLINL TKKEDVMRLK RSSIVLIALI 
ISACGGEASL PTINPQPQRP APQPRPTSAA DDQSAQWPTA EATSVAPAPQ PMPTQAADAG
QPVPNPAAGK PLVDTWELPT QPIDPNPNYA YEQDQEIFDS MYFKNYGTNP FVRTETDPLS
TFAMDIDSAS YSLMRSSINQ GLLPPADSVR VEEYLNAFDY EYPQPEDGDF AIYSEVAPSP
FGGPNYELVQ IGIQARSIEV ADRKPAALTF VIDTSGSMAQ DNRLEMVKNA LIYLAGQLEP
DDSLAIVAFN DGMRVVLNPT SGENQMDIIT AINSLEPAGS TNAEAGLYKG FELAWQAFKP
EGINRILLCS DGVANSGMTE PSQLLATFQQ YLDAGVQLST YGVGMGNYND ILLEQLADKG
DGNYAYFDSA DEAQRLFGEQ LTGSLQTIGR EAKIQVNFDP NVVKRYRLIG YENRAVADSD
FRNDSVDGGE VGAGHSVTAL YEIKRHPDAQ GPIAQVNIRY ISMDTNAPVE ESLNISTAQI
HSSFDRASAR MHLATSVAEY AELLRHSRWN NGTDILDVLD LAEEAALDLP NNQSAVEFVT
LLRRAEQMHQ