Gene Haur_5122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5122 
Symbol 
ID5737080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp162686 
End bp165547 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content56% 
IMG OID641282287 
Productsignal transduction protein 
Protein accessionYP_001547878 
Protein GI159901632 
COG category[T] Signal transduction mechanisms 
COG ID[COG5635] Predicted NTPase (NACHT family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATA GCACTGATGG TTCCGTCAAC GCTGATGATT CTGAGTTCTA CGGCCCCGTG 
GTGGGGGTCA ATCTTGGCAC GATCATCTAT GGCCGTCCAC CCGAAGATGC CGAGCGCCAA
CGCTTAGTGG CCTATTTGGA GCAGGTGACG AAAAGCCACA ATACCTTGCG GGTAGTTGGG
GTTGGCTCGT CGCATCTCGC GTCAGGCATT GACCTCGCAT CCGCCTATAT GATGCTAGCG
GTGCAGGGAC GGCAGCGGAT GCTGCGGCCA CTCACGGCGG AAGAAGTCGA GGCATATCAG
CAACACAGGT TTAAAATTCC CAAGGAACTG AGTGCTGATC GCTGTTTGCC CGATCACGCC
GTGCTTGCGG TCGTTGAGGA TAGTCAGTCT GGTCAATTGG CGTTGTTTCG CGCTGAGTTG
GCCACGGAAA CCGTCTTAGC GCATCCCTAC CTCGTGCTGT GTGGCCCGCC GGGGAGCGGG
AAATCAACCT TCGCCAAGCA TCTCGTGTGG GCTTTGGCCC AACGTGGCCG TGACCAGATC
AACCACCATA CAGGCTTACT GGGCTGGAAT GATCACCAGC GCGTGTTGCC CGTGTTCATG
TCCTTGCGAA CCTTGGCAGG CGCATTAATT GGGAAGGATT TAGGGTTGAC CGACACACCA
AACATTGGGC TGTTGCTCGA TGCGGTGTGT GCGCACCTGC AAACTAAGTA TGGACTTGAA
CAGCCGCGCG AGCTGCTGAA GGCGGGGTTA AAAGGTTCGC TGACGGTGTT GTTTGTCTTT
GATGGCTTGG ATGAAGTGCC ACTGGAGGCG ACCGCAGCGA GCCTTGATCG TCGCTCGCTG
TTGACCTTTG TGCGGTTGTT TGCCAGTGCC TATGCTGCTC GTATCCTCAT CACCTGCCGC
TCGCGGGCGT GGACAGAAGA CTATCGCCAG ATCACGCAGT GGCCCATGGT CGAATTGGCT
CCGCTGAGCG GTGGCCAAAT GACCCAGTTT ATCAATACAT GGTTTCCGTT GTTGCACGCC
AAGGGCGTGA TTGAGCACGA GGCCATTGCA CGTTATGGTG CGCAGTTGAT GCAGGCGTTG
CGCGATCCCC AGCGCCGCCG CTTACGGGAC ATGGCCGACA ATCCGTTGTT GCTGAGCATG
ATGATCTTTG TGTTGGCTCG CAAGGGTGTC TTGCCGCGTG ACCGCCATAG CTTGTACGAC
GATATTCTGA AGCAACTCTT GGGCGAGTGG GATACCACCA GTCGCAATGG GCAGAATTTG
GGGCAAGCGG TTGGGGATGA TCGGATTACG GGCGACGAGG TGCGCGATCA GGTGTTGGAT
CGGTTGTGTT ATCAGGCGCA TTTAACCGCC ACGTCAGCGG ATGGGCGGGG GCGGATTCCA
AGCCGCGAGC TTCAAATTGC CTTGATGGAG TATTTTGCAC GCGTCAACGT GGCTGATCCC
TATCGAGCGG CAGAACGCTG TGTTGCCTAT ATCGATCAAT GCAGCGGCTT GCTTCAGCCC
GAGGATGAGG GGATGGTCTA TGCCTTTGCC CACTTAACCT TGCAAGAACA GAGCGCTGGT
CGCCACTTGG TGTTTTCTGA ATCGCTCGAT CAATTGTTGG CCTTACGTCG TGATGACCGT
TGGCGTGAGC CGATCTTCTT AGGGGTTGGC TGCCTGACCA AAGCGAGGCT TGGCAGTGCC
AAAATTGAGC AACTCCTGAC AACGTTGGTT GATTCTGATG CCTATGAAGC GGGGGAGATG
CACCAATACG ATTGGTATCG TGATCTGATT TTGGCCGCTG AGTTAGGCGC GGATTGTGAT
TGGGGCTTGC TGCATGGCAA GCAGATCAAG GTGGATCGCA TCCAGCGACG GTTGCGGGCG
GGGCTGGTTA ACCTGCTTGA AGACTATGAC CATGCGCAAG CGGCGCTTGC CTATTATAAC
GGTCAAGCGA TGGAGCCAGC GCCGTTGTTG GTGCGTGAAC GGCAAAAGGG TGCCGAACTC
TTGGCAGGTT TGGGTGATGC ACGTTATCCG GTGAGTATCG AGCAATGGCA ACAGGTAACC
TGCCAGCTTT CCACCCAGTT TGGTCGCGAG GGTACTCATT ATTGGCGGTA TATCCCCGCA
GGCTGCTATC GGGTTGGTGG TTGGGATGGA GATGAACAAG CCACAACCGT CGAACTTCCA
TCCTACTGGG TCGGACGATT TATGGTGACC GTTGATCAAT ATCGGGCGTT TATCGAGGCA
GGCGGCTATA CCAACGATGC ATGGTGGACA ACGCAAGGCT TAGCTTGGAA AAAGGAAACA
AACCGAACAG AACCATGGGG TTGGAATGGT CAAATCGAGC AGGAATACCG GAATCAGCCT
GTTTATGGGG TGAGTGGGTA TGCAGCGATG GCCTATTGTC AGTGGTTGAG CGAGCAGCTT
ACGCCATGGC TGCCGCAGGG GTATTGCATT CGGTTGGCCA GTGAGGCGGA ATGGGAAGTT
GCAGCAGCGT ATAATGCCGA TGGCCAGCGC CATACCTATC CGTGGGGCGA GCAGCCTGCC
ACACCGGAGC ATGCGGTCTA CGATTGGAGC GATGAACGGC GACCGCTATC AGTGGGTTTA
GGGCTGCTGG GCCAAGCGGC TTGTGGTATG CTGGATAGCG TTGGGAACCT GTGGGAATGG
GCCGCCGTGC GGTATCAGGA CAATGGTGGC GATAGGCAGC AGGTGCTTGC GGATAGTAAC
GATTGGATGG TACTGCGTGG TAGCTTATAT TACAACAATA GTACAAAGAT TCTTTGCGCG
GCGCGTGACT GGTGTCGTCC CGACGACGAC GACGTCTACA ACTGCCCTGG ATTTCGTTGT
TTTTTAGCCC CTCGTTCATA TGTTTTGCAT GCTGCATCCT GA
 
Protein sequence
MADSTDGSVN ADDSEFYGPV VGVNLGTIIY GRPPEDAERQ RLVAYLEQVT KSHNTLRVVG 
VGSSHLASGI DLASAYMMLA VQGRQRMLRP LTAEEVEAYQ QHRFKIPKEL SADRCLPDHA
VLAVVEDSQS GQLALFRAEL ATETVLAHPY LVLCGPPGSG KSTFAKHLVW ALAQRGRDQI
NHHTGLLGWN DHQRVLPVFM SLRTLAGALI GKDLGLTDTP NIGLLLDAVC AHLQTKYGLE
QPRELLKAGL KGSLTVLFVF DGLDEVPLEA TAASLDRRSL LTFVRLFASA YAARILITCR
SRAWTEDYRQ ITQWPMVELA PLSGGQMTQF INTWFPLLHA KGVIEHEAIA RYGAQLMQAL
RDPQRRRLRD MADNPLLLSM MIFVLARKGV LPRDRHSLYD DILKQLLGEW DTTSRNGQNL
GQAVGDDRIT GDEVRDQVLD RLCYQAHLTA TSADGRGRIP SRELQIALME YFARVNVADP
YRAAERCVAY IDQCSGLLQP EDEGMVYAFA HLTLQEQSAG RHLVFSESLD QLLALRRDDR
WREPIFLGVG CLTKARLGSA KIEQLLTTLV DSDAYEAGEM HQYDWYRDLI LAAELGADCD
WGLLHGKQIK VDRIQRRLRA GLVNLLEDYD HAQAALAYYN GQAMEPAPLL VRERQKGAEL
LAGLGDARYP VSIEQWQQVT CQLSTQFGRE GTHYWRYIPA GCYRVGGWDG DEQATTVELP
SYWVGRFMVT VDQYRAFIEA GGYTNDAWWT TQGLAWKKET NRTEPWGWNG QIEQEYRNQP
VYGVSGYAAM AYCQWLSEQL TPWLPQGYCI RLASEAEWEV AAAYNADGQR HTYPWGEQPA
TPEHAVYDWS DERRPLSVGL GLLGQAACGM LDSVGNLWEW AAVRYQDNGG DRQQVLADSN
DWMVLRGSLY YNNSTKILCA ARDWCRPDDD DVYNCPGFRC FLAPRSYVLH AAS