Gene Haur_0691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0691 
Symbol 
ID5732592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp794783 
End bp796522 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content54% 
IMG OID641277821 
Productvon Willebrand factor type A 
Protein accessionYP_001543467 
Protein GI159897220 
COG category 
COG ID 
TIGRFAM ID[TIGR02226] N-terminal double-transmembrane domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTGC TTGTGCCATT GGGTTTAATT GGCTTAATCG CCTTGCCGAT CATCGTGGTG 
TTGCATATGG TGCGCCAGCG CCGCCAACGG CTGAGGATTC CGACGATTCG GCTGTGGGCG
GCATTAACGC CACCACCTGA GCGCCAACAA CGCAAATTGC CCCTAACCTT GCTGTTGGCT
TTGCATTTGC TGGTTGCTGC TTGTTTAGCC TTGAGTTTAG CCCAACCAGC TTGGATTTTT
GGAGCCGCTG CACCACGCCA CCTCGTAATT ATTCTTGATA CCACCTCTAG TATGGCGGCC
AATCGCTCAT TTACTCAAGC TCAACAACAA ACCGAGGATT TGATTAACGA TTTGGGGCGT
GATGATAGCC TGGCGCTGGT TGAGCTGAAT CACGAAGCGC GTTTGCTTGG CTATGGCGGC
TATGCTGAAC GCCAACAATT GCGCCAAATT GTGGCCGAGC TGGCTCCCGC TGGTAATAAT
GCCAACTTGG CTCAATCGCT GAGCATTGCC AATGCCACCC TCGCCAACGA TCGCCAAAAC
CAACTGATTG TACTAAGCGA TGGCGCATTG CCCGCCAACA GCACGCCGTT ATCGGTTGCC
GCCGAATTAG AATGGCGGAT GTTGGGCGAA AGCACCGCCA ACAGTGGCAT TGTCAATTTT
GCGAGCCGCC GTTTGCCCAA CCAGCGCAAC GCCCTGTATG CCCGTGTGAC CAATTTTAGC
GATCTGCCTG CGGCCCGAAC CTTGACTCTT TTGGTTGATG GTGAAATTGA ATCGGAGCAA
AATTTGGTCA TTCAGCCTGG TGGCAGCGAG GAGCGCACGT GGGAAGTAGC CAATGGCGAG
TTGGCTGAAT TACAACTTAG CCCTAACGAT GGTTATGCGC TTGATGATCG GGCGGTGCTT
GCGCTCAGTC GCTCTGGCTC ATTGCGAGTT CATTTGGCCA CATTAACTCC CTCGCCACTT
GAACGAATGC TGCGCAGTTT GCCCAATATT GAGCTAAGCG TTGGCCCAAG CGTCAGCAAT
CAACGGGTGG ATTTGACTGT GTTGAATGGA GTTTTACCGC AGCAATTGCC AACCAGCGCC
TTGTTGATTG TCAATCCGCC GAGCGACCCA CGCTTGCCAA CCCAAGATAG TGTGCTAGGT
GAGCAGGCTA GCAGCGCGGT CTTGGATGCC GATTTTGCTG GCATCGACCT TTCAAGTGTG
CAATGGGGTG GTCGTCGCCC GATCAAGCGT GAAGATATTC CCGCAGGCTT GAGCAGTGTG
ATCGAAACTG ATACGCAAGC GCCCTTGGTG CTGCGTGGCA CATGGCAAGA GCGAGCAACC
ATCGTTTGGC TGTTTAATTT AGATAATGCG AACCTTAGCG CAAAATTGGC ATTTCCTTTG
TTGACAGCGG CCAGCATCGC CAACTTAACG GGTGGATCAT TGCCTGAGCA ACTGGCGGCG
GGCAGTTTTG CCCCCAATAC GCCGCTAACC CGCCCTGATA GCGAGGCCCA AGCGCTTGAT
CAGCGCCTGA ATCAAGCAGG TTTGTATCGG GTCGTTGGGA GTAATCGTGG CGGGATTGCG
GTCAACTTTG GCGATCCACA GGAGTCAAAT CTGCAACAAC AAACCCAGCC AACGATTAGC
CAAAGCCCGC AACCTGAGGG TGATCGCTTG CCGCCCCAAG GTACGCCATT ATGGCCGATG
CTGGTTGGTT TGGCCTTGGT CGGATTGATT TTTGAATGGT GGTATAGCTT TCGATCGTAG
 
Protein sequence
MNLLVPLGLI GLIALPIIVV LHMVRQRRQR LRIPTIRLWA ALTPPPERQQ RKLPLTLLLA 
LHLLVAACLA LSLAQPAWIF GAAAPRHLVI ILDTTSSMAA NRSFTQAQQQ TEDLINDLGR
DDSLALVELN HEARLLGYGG YAERQQLRQI VAELAPAGNN ANLAQSLSIA NATLANDRQN
QLIVLSDGAL PANSTPLSVA AELEWRMLGE STANSGIVNF ASRRLPNQRN ALYARVTNFS
DLPAARTLTL LVDGEIESEQ NLVIQPGGSE ERTWEVANGE LAELQLSPND GYALDDRAVL
ALSRSGSLRV HLATLTPSPL ERMLRSLPNI ELSVGPSVSN QRVDLTVLNG VLPQQLPTSA
LLIVNPPSDP RLPTQDSVLG EQASSAVLDA DFAGIDLSSV QWGGRRPIKR EDIPAGLSSV
IETDTQAPLV LRGTWQERAT IVWLFNLDNA NLSAKLAFPL LTAASIANLT GGSLPEQLAA
GSFAPNTPLT RPDSEAQALD QRLNQAGLYR VVGSNRGGIA VNFGDPQESN LQQQTQPTIS
QSPQPEGDRL PPQGTPLWPM LVGLALVGLI FEWWYSFRS