Gene Haur_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1116 
Symbol 
ID5733008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1278362 
End bp1280050 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content49% 
IMG OID641278255 
Productvon Willebrand factor type A 
Protein accessionYP_001543892 
Protein GI159897645 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTTC GATTTATACT ACGCTTATTA TTGCTCTTCA TATTTTTGAG CGCCTGTGGT 
CAGGCGGCTA ACCCAATGCA ACCAAGCCAA TCCGGCCAGG CGAGCAGTGA TGATCTGGTG
TTGCGGTTGC TCTATGGCAG CGAAAAGCAA CTCTGGATTG ATGCGGTAGT GAGCGATTTT
AACGCCAGCT CAGCTAAAAC CGCTAGCGGC CAACGCATTC AGGTCGAAGC AGTGCCCGTT
GGCTCGCTCG AAACGATCAA CGGCTTACTT GATGGCTCGC AACAGGCCGA TTTGTGGAGC
CCAGCGAGCA GTTTATCCTT GCCATTAGCC AATCAACGTT GGAAGACGGC TAAAGGCGAA
TTGCTGTTTA GCGATCAAAC TCCCACCTCG TTGGTGCTTA GCCCAGTTGT GATTGGCATG
TGGAAGCCGA TGGCTCAAGC CTTGGGCTAC CCCGATAAGC AAATTGGCTG GGCGGATTTG
GCCGATTTGG CGACCAGTGG CAAAACTTGG GCCGATTTTG GTCATCCAGA GTGGGGTGCA
TTTCAGTTTG GCCACACGCA CCCTGAGTTC TCCAATAGTG GGCTTGCCAC GATTGTGGCG
ATGGCCTATG CTGCCAATCA AAAAACCAGC GATCTGACTG TCGCTGATTT GGATAAACCC
GAAACTGCCA GTTTGATCAA TGCTGTCGAG CAATCAGTGA TTCACTATGG CTCAAGCACA
GGCTTTTTTG CCAAAACCAT GTATGAGCAT GGCCCTTCGT ATTTGTCGGC GGCAATTTTG
TATGAAAATC TCATCATCGA GTCGTATGAT CAAGCGTTGT ATCCCAACCT TGAGTTGCCG
ATGGTGGCGA TTTACCCCAA GGAAGGCTCA TTTTGGAGCG ACCATCCCTT GGTGGTGCTG
GAAACCGAGC GCATGAATGC CGACAAACGG GCTGCAGCGC AAGTATTTCA AGAGTTTTTG
CTGGCTCAGC CTCAACAAGC CAAGGCCATG CAATATGGTT TTCGGCCAGC CAATGTTGAT
ATTAGCCTCG CTGCGCCAAT TGATACGGCG CATGGCGTTG ACCCAAGCCA ATTGCAAGTC
GCCTTGCCAA CGCCTTCGGC AGAGGTTTTG CAGGCCATAA CTCAATTGTG GCAGCAGCAC
AAAAAGCAAG TTGATGTAGC GTTGATTATT GATACTTCTG GCTCAATGCG TCAAGAAAAC
CGTTTGCGCG AAGCCAAAAC GGCGCTTGGC GATTTTATCG ATATCTTTGC CGATCAAGAT
AATGTGCAAG TGACGATTTT TAGCACCAAT GCAACCGAGC TTTCCGATCT CTCGCCGATT
GGCCCCAAAC GGGCCGATTT GCATACTCGC ATCGATGGAT TGGTGGCCGA TGGCGAAACT
CGTTTGTACA GCACAATTGG CGAAGTCTAT ACCGATATTC AGCAACAAAC TGAAGTGCAG
CGGATTCGCG CATTGGTGGT GTTGACTGAT GGCGAAGATA CGGCTAGCTC ATTGAGTTTA
GAGCAATTGA ATGAACAAAT TCGCCAAGAT GAATCTGGCA CGTCGATTAA AATTTTCACG
ATTGCCTATG GCTCTGATGC CAATCAAGAG GTTTTGCAAC GAATTGCCGA AATCACTGGA
GCCAAATCAT ATACTGGCGA TCCGGCGACA ATTCGTCAGG TTTATCATGA AATTGCTACA
TTTTTCTAG
 
Protein sequence
MRFRFILRLL LLFIFLSACG QAANPMQPSQ SGQASSDDLV LRLLYGSEKQ LWIDAVVSDF 
NASSAKTASG QRIQVEAVPV GSLETINGLL DGSQQADLWS PASSLSLPLA NQRWKTAKGE
LLFSDQTPTS LVLSPVVIGM WKPMAQALGY PDKQIGWADL ADLATSGKTW ADFGHPEWGA
FQFGHTHPEF SNSGLATIVA MAYAANQKTS DLTVADLDKP ETASLINAVE QSVIHYGSST
GFFAKTMYEH GPSYLSAAIL YENLIIESYD QALYPNLELP MVAIYPKEGS FWSDHPLVVL
ETERMNADKR AAAQVFQEFL LAQPQQAKAM QYGFRPANVD ISLAAPIDTA HGVDPSQLQV
ALPTPSAEVL QAITQLWQQH KKQVDVALII DTSGSMRQEN RLREAKTALG DFIDIFADQD
NVQVTIFSTN ATELSDLSPI GPKRADLHTR IDGLVADGET RLYSTIGEVY TDIQQQTEVQ
RIRALVVLTD GEDTASSLSL EQLNEQIRQD ESGTSIKIFT IAYGSDANQE VLQRIAEITG
AKSYTGDPAT IRQVYHEIAT FF