Gene Haur_3942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3942 
Symbol 
ID5735803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4940118 
End bp4942613 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content49% 
IMG OID641281093 
Productvon Willebrand factor type A 
Protein accessionYP_001546704 
Protein GI159900457 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00244396 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTAA GTCGCATGAC GCGACGTGTA GCCCAACCAG TCGCAGGACA ATCATTAGTA 
TTCTCTGCTA TTTTGTTCTT CGTCATGATT GCGTTTGCTG CACTAGCCAT TGATACAGGC
GAGGCGTTTA GCCGCCAACG CCAGCAACAA GCAGCTTCAA CTGCTGCTTC AATTGCCGGT
TTGGAATCCA TGAATAGCGA AATCGACGGA ACTGATGGCG CTGTTCAACA GGCTATCCGC
GATGCTTTGG CTGCCAACGG CATTACCAAT GCCGTTTATA TCAACGGCGA TTGGGGCAAC
CTTGACCCAA GCCAAAACTA CTATCGGGCT TTCTATACCC AACGTGGCTC ACGGGTTGAA
TATCCGGTTG GCAGCGGTGG TCAAGTTTCG ACCGAGTTCA ATGGCTTGCG GGTTGAAGTT
CGTTCAGCAC GGAACACCAT CTTTGGCCAA GCCTTAGGTA TCGATACGCT CGAGGTTTCG
GCTGAAAATA AAGCAACCTT GTGTCAATGT GCTACCAACA TCTTCCCGCT GGCGGTTGCG
ACCCAAAAAA TGACGGGAAT GTTGCCAGGT GAAAGCAAAC CGATTGCTTG GACGAAAAAT
AAAGTTGGCT CATCGGTCGA AAAAGACTAC TTCGTCTGGG CCGAATGGCA AACCTTGTCG
GGCTTCAACG CCGATAATCG TTTGAAGGAA TCGTTGGGTG GTACTGGTGA TGTGATTAAA
GGGGTCACCG AGGCTACCGC ACCTCCAGGC TATGCCAACG CTAGCAATAA CGGCGTGATC
AACATCGACG ACTGGATCAA GGTTTCTGAT AGTACGTTGC GCCCAAATGC AGCAAGTTTG
GCTACTGCCT TGACGGCCTT GAAGAACAAG CCAATCCTGA TTCCAACCTT TACCAAGTTG
AGCTATTCGG GTACAACTGG CAACGAAAAA TATACCAGTT TCCGCTCAGG TGGCTTTGTC
AAGGTCATGG TGACCAATTT TGACAGTACT GGGATTACCA TCCAATATCT CAACAGCAAC
TATACCTGCC CATGTGTTGA TGTACCACAA CCACCACCAC CACCAGAAGT CAAGCTTGCC
CTTGACCAAA AGTTGGTTTG GTATTTGCCA AGTATCAATA CCACGAGCTA CGATATTTCG
TTAGTGGTTG ACATTTCAGG CTCGATGCAA TGGTGCTACG ATAGTCAACG GACTTGTAGC
GTCGATGCCA ATGCCCGCTG GTATCGGGTT AAGGACTTCT TAGCCAAGTT CTCATATAAG
ATGCTTGATG TTTGGAATGC TCCTGCTGGC CAAAATATGA ACAACGCTAG CTTGTTCCCT
GGAGAAGCAT TGGTTGGTAA AGGTGGCGAT AACCGGATTG CTGCCGTGCG CTTCAGTGGT
AATGCAGTTA CTAGCAGCCC ATCGTTTGGT TTTGTAACCA GCCCAGCTGG TAGCGACCAA
GCCAGCGTGA GTGCTCGTAC CACCACGATG CGCTCTAACA TGAATTCCTT GATCAGCTGG
ATTACTAAAG CCAATATGAG CGGTAGTACC TCAGGTGGTC GCGGTTTACG CGAAGGCATT
CGCTACTTTG ATAATGTTAG CGCTCATACC CGGGTTGATC GTTTTGGTCG CCCAATCAAA
TTGGTAATGG TGATGTTGAC CGACGGTTTG ACCAACGTGA TGTACGAAGG ACCTCAAGTT
AACTCACAAA ATAGCCAAAA ATTGAAACAT ACCCGTTCTG GTGGCTATAA ATATTGTAAA
GATCCTGCTG ATCCAACGGC TAATAACGTT TTGACCGTAG GTGGCGATAA TTATCCGATC
ACCGACTTGC CTGAAGTTCA AGCCAACTGT CCATGGAATG GTCAAGGGGC AGGTAATGGT
TATGCCAAGG CTCCAATTGT ATCGCTGGTT GAGGTTGCGA CCCAAGCCCG TAATCGGGTC
TCGCCGCAGC GACCAGTCAA CATCTATGCA ATCTTGGTAG GTGATCAAGG TCGATACGAT
TTACAAGACT TGCGGATTGA CCAAATTGCT TCACCTGGTG GCGCGTTCTA TGCGCAAAAC
CCAAATGCCT TGGACTATGC ACTCGATGCG ATTCTGGATG ACCTAGCCCA GCCATGTTAC
GAGCGCGATG CCACGATCAT TGCGGCAGGT GCTAAAGTTA CGGTCTACGA TAGCTCGAAT
AACCCAGTTG CTGGGATGAG CAACATGTCG GCAGACACGA ATGGTACCTT ACTCTTTACC
GTACCCGAGG CAGGCAACTA TAGTTTTGGG GCAACCCGTA GTGTAACAAG CTGGAGCGAA
TTCCCATTAG GTAGTGGTGA AACGATTAAT CCAAGCTTGT ACCCTGCCAA CTACTTGCCA
CAACTGTATA ATCGGTTACG CGGTGCTGAA GATACGATTG TTCAAAATCG CATCCAATTC
AATATCCCCG CAGAGGCTGA TAGCTTGATT GATCTTGGTA AGTACACCAT GATCATTGCC
GAAGCCCAAC AAAACAAAGC ACTCTGTCCA GAATAG
 
Protein sequence
MNLSRMTRRV AQPVAGQSLV FSAILFFVMI AFAALAIDTG EAFSRQRQQQ AASTAASIAG 
LESMNSEIDG TDGAVQQAIR DALAANGITN AVYINGDWGN LDPSQNYYRA FYTQRGSRVE
YPVGSGGQVS TEFNGLRVEV RSARNTIFGQ ALGIDTLEVS AENKATLCQC ATNIFPLAVA
TQKMTGMLPG ESKPIAWTKN KVGSSVEKDY FVWAEWQTLS GFNADNRLKE SLGGTGDVIK
GVTEATAPPG YANASNNGVI NIDDWIKVSD STLRPNAASL ATALTALKNK PILIPTFTKL
SYSGTTGNEK YTSFRSGGFV KVMVTNFDST GITIQYLNSN YTCPCVDVPQ PPPPPEVKLA
LDQKLVWYLP SINTTSYDIS LVVDISGSMQ WCYDSQRTCS VDANARWYRV KDFLAKFSYK
MLDVWNAPAG QNMNNASLFP GEALVGKGGD NRIAAVRFSG NAVTSSPSFG FVTSPAGSDQ
ASVSARTTTM RSNMNSLISW ITKANMSGST SGGRGLREGI RYFDNVSAHT RVDRFGRPIK
LVMVMLTDGL TNVMYEGPQV NSQNSQKLKH TRSGGYKYCK DPADPTANNV LTVGGDNYPI
TDLPEVQANC PWNGQGAGNG YAKAPIVSLV EVATQARNRV SPQRPVNIYA ILVGDQGRYD
LQDLRIDQIA SPGGAFYAQN PNALDYALDA ILDDLAQPCY ERDATIIAAG AKVTVYDSSN
NPVAGMSNMS ADTNGTLLFT VPEAGNYSFG ATRSVTSWSE FPLGSGETIN PSLYPANYLP
QLYNRLRGAE DTIVQNRIQF NIPAEADSLI DLGKYTMIIA EAQQNKALCP E