Gene Haur_0249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0249 
Symbol 
ID5732144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp289880 
End bp292732 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content53% 
IMG OID641277373 
Productvon Willebrand factor type A 
Protein accessionYP_001543029 
Protein GI159896782 
COG category[S] Function unknown 
COG ID[COG5426] Uncharacterized membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTATCGT TCGTCCGACC AGAATATCTC TGGTTTTTGC TGGCGCTGCC ACTGGTCTGG 
CTGTTGGGCT GGCTCAATAA CCGTGGTCGC ACAGGCCAAC GCCGTTGGTG GGCCTTGGGC
TTACGCACGC TATTGCTGCT GTGTTTGATC GGCAGCCTTG CCGGAACCCA AGTACGCCAA
CCAGTTCAAA ACCTAACCAC GGTATTTTTG CTCGATAGCT CCGACTCAAT TGCCCCGGGC
CAACGCTCTA ACAACGAGCA ATTTATTGCC CAAGCGCTCG AAACCATGCA AGAAGGCGAT
AAAGCTGCCG TCGTGGTGTT TGGCGAAAAC GCTTTGGTTG AGCGGGTTCC CTCTGAAATT
CAGCGCTTAG GCACAATTCA ATCGGTGCCA ATTGCTGCGC GTACCGATAT TAGCGAAGCA
ATTCAGCTTG GTTTGGCGCT GTTTCCCGCC GATACCCAAA AACGTTTGGT GTTGCTCTCT
GATGGTGGCG AGAACAGTGG CCGCGCCTTG GAGATGATCC CACTGGCGCA ACGCCGCAAT
GTGCCAATTG ATATTGTGCC AACAGGCATT GGCCAAGGCA ACCCCGAAGT GGCAATTAGT
GCCTTTCGGG CACCATCAGC CGCCCGTTCA GGCCAAGAAA TTCAGTTGAT CGCCACGATT
GAAAGTAATA CCGCCCAATC AGCCCAATTG CGCTGGCGAG CCGATGAGCA AATTGTACTG
GAAGAAGCGA TCAATCTACC AGTTGGCACG AGTAGCTTTA CCACAACGCT CGTTGTCAAC
GATCAAGGCT TCCATCGCTA TAGTGCCCAA GTTGTGCCAA CCAGCGATAC GCGTGCCCAA
AATAATGTGG CGGCTTCGTT GGTGCAAATT GGTGGCCCGC CCAAGGTGCT GCTGGTTGAA
GGCGAAGTCG GCGATGCAAG TGCGCTCAAG CCAGCACTCG AAGCTGCCAA CCTTGTACCA
GTCGTTGTTC CAGCCACTGG CTTGCCCACC GATTTAGCAG CATTGAGCGA TTATGAAGCA
GTCTTGTTGC TGAATGTGCC GTCGCGCGAT ATCGATCAAG ATACCCAAAA ATTATTACGC
TCGTATGTTG GCGACCTTGG GCGTGGCTTG GCAATGATCG GCGGTCGCCA AAGTTTTGGC
GTGGGTGGCT ACACGGCCAC CCCCATCGAA GAAGCCTTGC CCGTCAATAT GGATGTGCGC
AATCGCCAAC AACGCCCTGA TATTGCTTTG GTGTTTATCA TCGACAAATC GGGCAGTATG
GATGCTTGTC ACTGTAACGG TGGCGATATG GCGGCGCGTG AAGGTGGTGG CACGCGCAAA
ATCGATATTG CCAAAGAAGC GGTGGCTCAA GCCGCTGCGG TGCTGGGCAA AGACGATAAA
TTGGGTGTCG TGACCTTTGA TGATTCGGCG CATTGGACGA TTGAACTCGA TAAAGTGCCC
AGCCAAGATG ATGTTGTCGC GGCTTTGGCT CCTGTGCCAC CAAGCGGCCA AACTAACGTG
GTTAGTGGCA TGAACGCTGC CTATGAGCAA TTGCGCCAGA GCGATGCTAA AATCAAACAT
GCGATTTTGC TGACCGATGG TTGGGGCCAT GCTACCGATA TCGGATCAAT CGCCGAAAAT
ATGAACAAAG ATGGCATTAC GCTCTCGGTG GTTGCAGCAG GTAATGGCTC GGATAACGCT
TTGCAACGCT ATGCTGAGCT GGGTGGTGGA CGTTATTATC CAGCCCGCGT GATGGAAGAA
GTGCCGCAAA TCTTCTTGCA AGAAACGATT CAGGCGGTTG GCACTTATAT CGTTGAAGAA
CAATTTACCC CGGCTTATGC TGGCGATAGC CCGGTGCTGG CCGATTTGCA AGAAGGCTTG
CCAAGCTTGT TGGGCTATAA CGGCACAGTC GAAAAAGATA ACGCTCAAGT TATTTTGACT
GCCAGCGATG GCTCTCCCAT TTTGGCCCAA TGGCAATATG GGCTTGGCCG GAGCATCGCT
TGGACGAGCG ATCTCAAGGG CAAATGGGCC TCAAACTGGG TCACATGGGA AGAATTTCCA
CGCTTCACGG CGCAGTTGGT TGGTTGGCTT TTGCCACGTA TCAGCAACGA TAATGTCAGT
GGTGAGGCCT CGTTAATTGG CAGCGACGTG CAAATTGATA TTGTTGCCAA CGACGAGAAG
GGCAATCCAC AAACTGCGAT GAACGTCAAT GCTCGTTTGA TCGGGCCAAC TGGCGAGGCG
ATTGATGCAA CCTTGGCTGA AGTTGGGCCT GGCCAATATC GCGCACGGGT GGCTAGCCCA
ATTGCTGGCA CCTATTTGAT TCAGGTGATC GGCAATGATG CAAACGGCAA GCCAGCCTTT
GCCCGCACCT TGGGCTTGAT TGTTCCCTAC TCGCCAGAAT ATCGCCAAGG CCAATCTAAC
CCTGAATTGC TGAGCACTTT GGCCAAAGCC ACTGCGGGCC GCAGCTTGAG CCAACCAATG
CAAGCGTTTG ATCATACGCT GGATGCAGTG CGCCGCGCTA CGCCTATTGA TTTGGGCTTG
TTGTTTGCAG CTTTGGTGTT GCTGTTGCTT GATATCGCAA TTCGCCGCCT CAACTTGCGC
CGCAAAGATT TTGCCGCCTT GCAAGCAGCT CGCAAAGAGC GCCAAACGAT TGCTGCCGCC
CCAACTGCCA CAATGAACAG TTTGCAGGGA GCCAAGGGGC GTGCCCGCCA GCAAATGTTC
AGCGATAAGA GCGAGCGCGA AGTTAAGCCC AAAGAAAACC CAGCAACTAC GCCATTACCA
AGTACACCAA ACAATCCAAC CAAAGCCGTT GATGAAGCCG AAGATCCACT CGAACGGCTC
CGCGCCGCCA AAAATCGTGC CCGCAGGCAA TAA
 
Protein sequence
MLSFVRPEYL WFLLALPLVW LLGWLNNRGR TGQRRWWALG LRTLLLLCLI GSLAGTQVRQ 
PVQNLTTVFL LDSSDSIAPG QRSNNEQFIA QALETMQEGD KAAVVVFGEN ALVERVPSEI
QRLGTIQSVP IAARTDISEA IQLGLALFPA DTQKRLVLLS DGGENSGRAL EMIPLAQRRN
VPIDIVPTGI GQGNPEVAIS AFRAPSAARS GQEIQLIATI ESNTAQSAQL RWRADEQIVL
EEAINLPVGT SSFTTTLVVN DQGFHRYSAQ VVPTSDTRAQ NNVAASLVQI GGPPKVLLVE
GEVGDASALK PALEAANLVP VVVPATGLPT DLAALSDYEA VLLLNVPSRD IDQDTQKLLR
SYVGDLGRGL AMIGGRQSFG VGGYTATPIE EALPVNMDVR NRQQRPDIAL VFIIDKSGSM
DACHCNGGDM AAREGGGTRK IDIAKEAVAQ AAAVLGKDDK LGVVTFDDSA HWTIELDKVP
SQDDVVAALA PVPPSGQTNV VSGMNAAYEQ LRQSDAKIKH AILLTDGWGH ATDIGSIAEN
MNKDGITLSV VAAGNGSDNA LQRYAELGGG RYYPARVMEE VPQIFLQETI QAVGTYIVEE
QFTPAYAGDS PVLADLQEGL PSLLGYNGTV EKDNAQVILT ASDGSPILAQ WQYGLGRSIA
WTSDLKGKWA SNWVTWEEFP RFTAQLVGWL LPRISNDNVS GEASLIGSDV QIDIVANDEK
GNPQTAMNVN ARLIGPTGEA IDATLAEVGP GQYRARVASP IAGTYLIQVI GNDANGKPAF
ARTLGLIVPY SPEYRQGQSN PELLSTLAKA TAGRSLSQPM QAFDHTLDAV RRATPIDLGL
LFAALVLLLL DIAIRRLNLR RKDFAALQAA RKERQTIAAA PTATMNSLQG AKGRARQQMF
SDKSEREVKP KENPATTPLP STPNNPTKAV DEAEDPLERL RAAKNRARRQ