Gene Haur_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1973 
Symbol 
ID5733862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2421714 
End bp2424518 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content50% 
IMG OID641279117 
Producttail collar domain-containing protein 
Protein accessionYP_001544744 
Protein GI159898497 
COG category[S] Function unknown 
COG ID[COG4675] Microcystin-dependent protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAA TTGATGATGG GTTAATCGTT TATTTGCAAT TAGATGCGTT AAGTGCGGGC 
CAAACCGTTT TAGATATTTC GGGCAACAAC AACAATGCTA TCGCCCATGG CTCACTCCAA
CTCGTTTCCG ATGACGTGTT TGGCAGTGTA CTGGATTTTG ATGGCAACGG CTATCTCGAA
TTGAATCCCG CTAGCATTCC GGCTGGTAAT GCGATCACTG TGAGTTTTTG GGCCTGTGGG
GCAGAAAATT TGCCTGGCGC AGGCGTTTCG GCGATTGCTG CCTTGACCGC TGCTGATGAG
CGTTCGCTGA ATGTGCATTT GCCATGGAGC GATGGCCAAG TTTATTTCGA TTGTGGAGCC
GAAGGTAATA CTTGGGATCG TATGAACCAA GCGGCCAATG CGAGCGATTA CAAAGGCAGC
TGGACCTATT GGGCATTTAC CAAAGATGTT GCCACCGGCA CGATGCAAAT TTATGCTAAT
GGCACGTTGT GGGCGCAAGC CACTGGCAAA ACCTTGCCGA TTAAGGCTGT GAGCCAAGCC
ACCTTTGGTG CATATAGCCC CAATACAGCG CCTTACTACG GCAAACTTTC CAGCCTACGG
ATCTACAATC GCGCTTTGTC GGGCGATGAA ATTCTGCAGG TGATGGCGGC TGATCCAGCG
GCGGCAGCAA CTTTCGATAG TTTGTATCCG GTTGGTTTTA ATTTGCTTGA TGATGATGGT
CATGCGACGA TGTACATCGA CGATAATCCC CAAGGCTACA ACCTCTTTTT GACCATCAGC
AATAACTCGC CGCGCACAAT TAATCTGGTT GCGCCGAATG ATCAAACACC CTCGGCGAGC
AATCACCATT TTGTGCTGAA TTTTCGGCCT GGCACACTCT CAACCGCGAC GCTGAGCCAA
GTTAAGCTGA CCGACCAAAC GTGGCAACTC AGCACAGGGC AAGCGGTTGA TGGCACGACA
AGTTTGTATT TGCTGAGCAC TCAGGCACGG AGTTTGACCC CCAATCAAAG CCTGAATTTG
GCCTTACAAT CAATCAGTGC CTCGGCGAGC AGCGGCGCAC GCAATACGCG AGTCGAATTA
AGCTACCAGC AATTGCAATA TGCCAACAGC CTCACGCCAT TAATTGGCAG CCGTTTGCAA
AACATGGTGG TGTTGAATGC TCAAGGTGCA CATACACCGC CGTTGCATGT AGGCTTTGTT
GGGCCAAATA TGGTGCTAAA TAATGGCACG ACGGCGAATC AACTGACCTT ACATATTACC
AACCTGCTTG ATGATGCAGC GTTGGCGCTT AGCCCAAGCA CCAGCAATAG CCCAACCACC
TTCATCGTAG CCTTTGACAG CCAGGATAAC CAAAATAATC GTGATTGGGC CTTGGGCACG
ACCAGCCAAG TTAATGCGAT TGCAATCACG CTTGATAATT GGACAGGTGG CAAAGATCCT
GAATCGGCAG AATGGATTTT CACCACGACC CAAACCTCGC TGGCAGCAGA AGCCTTTATT
CAAATGAACC TGAGCAACAT CGTTTCATCG CTGCCAAGCG GCGCGGCCAA CCTCTACTTG
CTCTATGAAA ATTTGCCAGG CTATCGTGAT GGCTACTTTG TGGTAGCGAT TGAAAAAACG
CCATTGATCA TTACTAGCAA TCAACGAGTT GGGGTTGGCA CGAATGCCCC AACGCGGCCA
TTGGCGATTC GTGGCAGCGG CGCAGGCGAA GAAGCTATTA GTTTTGAAAA TAGCAGTGGC
ACAACCAAAT GGCATATTAA CCAAAAAGTT ACGACCCAAG GTGGCTTCAA TATTGCCGAA
ACAGGCGTGA AAGATGGGCG TTTATTTCTC CAAACAGGTG GTAATACTGG AATTGGTACT
GTCACACCAC GGACTGGTTT AGATACTGGC ACTGGCGTGC TGAGTGGTGC TGCCAACGAC
TATACCAAAG CCCAATTTGC CTTTACTGGC GGTGGTCGTG TAACCTGGAA CGGGATTAAT
AATCGCTTGA AATGGAGCCA ACGCTTTATT GCGATTTCGA TGGAACGATC AGTTTCGTTT
GCAGGTGGCC ACGTGAATGT CATTCAACCA ACCAGCGATC TTCCCGCGGC CCAAGTCTAT
GACAACCAAG TGCGTTCGTG TACTGCCGAT GGAATTGTGC TCAAGGCATG GGAAGCCTTA
TATGCGGTGC ACACGGTTGG GGGCAACGAA AATGCCGTCA GCTATCAGAT CACCTGTTAT
ACCACAGCCT CGTTCTTTGT GCCGAGCAAT TGGCTGTTGG TGGCGGTAGT TAATGGCGAT
GATGGCACAG TGCGTTTGGC TAATGGGCTT GTGCTCAATC CTGGGGCTAG CTCGCTGAAT
GGTAGTTCAA TTCCATCGGG CACAATTAAT ATGTGGTCGG GGGCAGATAA TGCCCTACCT
GGTGGTTGGC TCTTGTGTAA TGGCCAAAAT GGCACGCCCG ATTTGCGCAA TCGCTTTGTG
GTTGGCGCTG GGGCTGCCTA CCCTGTTGGT ACAACTGGTG GTGCGGATAG CGTGACGCTA
GCGGTGAATC AGATGCCTTC CCATAACCAT GCAGCCAGTA CATCAAACGA TGGTCAGCAT
AACCATACCC TCTACTTCGA TACTGGTGGT GGTGGTAACG GCCCTGGTGG CGATATGGCA
AAAACCAATG ATGGCTTGCA AAAAAATGTG ATTGCTAATT TCAGCGTCAA AACCGATAAG
GATGGTAATC ACTCACATAG CGTTACGATC CAAAATAATG GTGGCAACCA AGCCCATGAA
AATCGCCCGC CCTTCTACGC GCTTTGCTAC ATTATGAAAC AATAG
 
Protein sequence
MTTIDDGLIV YLQLDALSAG QTVLDISGNN NNAIAHGSLQ LVSDDVFGSV LDFDGNGYLE 
LNPASIPAGN AITVSFWACG AENLPGAGVS AIAALTAADE RSLNVHLPWS DGQVYFDCGA
EGNTWDRMNQ AANASDYKGS WTYWAFTKDV ATGTMQIYAN GTLWAQATGK TLPIKAVSQA
TFGAYSPNTA PYYGKLSSLR IYNRALSGDE ILQVMAADPA AAATFDSLYP VGFNLLDDDG
HATMYIDDNP QGYNLFLTIS NNSPRTINLV APNDQTPSAS NHHFVLNFRP GTLSTATLSQ
VKLTDQTWQL STGQAVDGTT SLYLLSTQAR SLTPNQSLNL ALQSISASAS SGARNTRVEL
SYQQLQYANS LTPLIGSRLQ NMVVLNAQGA HTPPLHVGFV GPNMVLNNGT TANQLTLHIT
NLLDDAALAL SPSTSNSPTT FIVAFDSQDN QNNRDWALGT TSQVNAIAIT LDNWTGGKDP
ESAEWIFTTT QTSLAAEAFI QMNLSNIVSS LPSGAANLYL LYENLPGYRD GYFVVAIEKT
PLIITSNQRV GVGTNAPTRP LAIRGSGAGE EAISFENSSG TTKWHINQKV TTQGGFNIAE
TGVKDGRLFL QTGGNTGIGT VTPRTGLDTG TGVLSGAAND YTKAQFAFTG GGRVTWNGIN
NRLKWSQRFI AISMERSVSF AGGHVNVIQP TSDLPAAQVY DNQVRSCTAD GIVLKAWEAL
YAVHTVGGNE NAVSYQITCY TTASFFVPSN WLLVAVVNGD DGTVRLANGL VLNPGASSLN
GSSIPSGTIN MWSGADNALP GGWLLCNGQN GTPDLRNRFV VGAGAAYPVG TTGGADSVTL
AVNQMPSHNH AASTSNDGQH NHTLYFDTGG GGNGPGGDMA KTNDGLQKNV IANFSVKTDK
DGNHSHSVTI QNNGGNQAHE NRPPFYALCY IMKQ