Gene Haur_5163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5163 
Symbol 
ID5737121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp235604 
End bp238513 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content49% 
IMG OID641282328 
Productsignal transduction protein 
Protein accessionYP_001547919 
Protein GI159901673 
COG category[T] Signal transduction mechanisms 
COG ID[COG5635] Predicted NTPase (NACHT family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGATA CTACCATCGG TTCGGTCAAT ACCAATTATT CAATGATTGA TGGCCCCGTT 
GTTGGCATTA ATTTAGGCAC GATTATCTAT GGACGGGCAC CAGAGGAAGG TGAGCGGCAG
AGCTTAGTCC GATATTTGGA ACAACTTTCC AACAGCCATC GCAAAATCCG AGTCATTGGA
CTTGGTCCTT CGCGCCTTGA ATCGGGCATT GATCTTGCAT CCGTCTATAT TATGCTGGCG
GTGCAAAAAC GGTATCGCAT TGTTCGCAAA CTGACTTCGT TTGAAATTAT TGACTATCAA
CGTCAAAAGC TCAGGATTCC CCATGAGTTG AAGCCTGATC GCTGTTTACC CGATCAGGCG
ATTATCAAGA TTGGGAAACA TCAACGATAT GGTTGGTTGA TGTTCCGCGC GGAATTAGCG
ACTGAAACCA TTGCGCAGTA TCAGTATCTC ATCCTCTGCG GTGCGCCTGG GAGTGGAAAA
TCTACATTTG CTAAACACTT GGTCTGGGCA TTGGCGCAGC GGGGACTTGA TCAGATCAAT
CATCAAACAC ACCTCCGTGG TTGGACTGAT AAACGGCAGC TTCTCCCTAT CTTTATGCCA
CTACGGCAGC TGGCAGGAGC TTTAGCGGGC AATGATCTGG GTTTGCATGC TGAACCAAAA
ATTGGGTTAT TGCTCGATGC ACTCTGTGAC TATTTACAGA CACACTATGG GTTAGATGAA
CCACGTACCC TGTTAACGGC TGGTTTGAAC CAGCGTCACA AGGTCTTGTT TGTGTTTGAT
GGACTTGATG AAGTTCCGGT TGAAGCCAAT GAGCATAGCC TTGATCGCGC GTCGTTGCTG
CGGTTCTTGC GGATTTTTGC CGATCATCAG CCGAACGCTC GTATGCTTAT TACCTGTCGT
TCACGGGCGT GGACATCGGA ATATCGCATG ATCACCCAAT GGCCGATGCA CGAGTTAGCT
CACTTGACTG GGGGGCAAAT TACTCATTTT GTTCATTATT GGTTTCCGCA GTTGGTATTA
AGTGGGGTTA TTGGTCATGA CGAAGCACAG CGGTATAGTA CCGAACTTTT GAAAGCCTTA
CAGCACCCCA AACGCCAAAA ATTACGGCTG ATGGCAGAAA ATCCCTTGTT ACTAAGTATG
ATGATTTTTG TACTGGCCGA AAATGGTGTC TTGCCCCGCG ACCGTCATAG TCTCTACGAG
CAGGTGCTCG GTCAATTGTT GGGGCAATGG GATGCGAAAC GTGAAGGGGA CAACTTAGGA
CAAGCCATCG GCGATGAACG AATTACCAGC CAAGAGCTCC GTAATCGGGT GCTTGATCGG
CTGTGCTATC ACGCACATAT GCAGGCCTTG TCGGTTGATG GGCGTGGACG AATTCATGGC
CGTGAGCTGC GCCTTGAGTT AATGGATTAT TTTAATCGGG TCAAAGTTGC CGATCCCTAT
CGGGCAGCGG AGCGCTGTAT TGCCTATATC GATCAGCGTA GTGGTCTATT ACATCCGGAA
GATGCAGGTA TGGTGTATGC CTTTGCGCAC CTGACCTTGC AAGAACATAG TGCAGGCCGC
CATTTGTTGT TTTATGAATC AATTGGACAA ATTTTAGCCT TACGGCATGA TGATCGGTGG
CGTGAGCCGA TCTTTTTAGG TGTAGGGTGC TTGACGAGTG AAAGTTTAGG ATCAAGCAAG
ATCAGCGAAC TGTTAACCGC ATTAATCGAT CGCTATGACT ATGGAAGTGA TACCTGCAAA
CCGTCTCATG TATGGTATCG CGACGTAGTG CTGGCTGCTG AATTGGGATT TGATCGTGAT
TGGGGGTTAT TGAGCGGTAC AGGGATTGAT GTTCGCCGTA TTAAACGCGA AATACGGCTC
GGCGTAGTGC AAATGCTTCA TGATCGTCAG CATGCGCAAT CCGCTCTTGA GTATTTCTAT
GGGGCAGCCA TGAAACCGAC ACCGCTCTTA GTCAAGGAAC GCCAACACGC TGCCGAATTG
TTGGCAGGGC TGGGAGACCC ACGCTATCCT ATCGATGGAA CGCAATGGCA GCAGGAGACA
ACACACCTAT CACAACAGTT CGGACGCGAG GGGACCCATT ACTGGCGCTA TATGCCTGCG
GGTCAGTACC AGCGTGGTGA CGCAGGAACA GACATAGCCG AGACCATAGA AAAACATCTG
GGTGATTTTG ATCCTGCTGG GATGAATCAC CGTGGTCAGG GTGATACTCA GATCGGGGAT
GTAGGGGTTG TCCCTCATCC ATATTGGATT GGACAATTTA TGGTGACGGT GGAGCAATAC
CAGGCCTTTA TCCAAGCAGG AGGGTATCAC ACTGATCGAT GGTGGTCGAC GCATGGCAAG
GCATGGAAAA CAATGATTGC ATGTAGCGAG CCTTGGTGGT GGGAACAACA AACGCTCCAG
CAATATATCA ACCAACCCAT CTATGGAGTA AGTTGGTATG AAGCCGTGGC ATATTGTAAC
TGGCTGAATC ACTACCTCCA GCCAATGCTA CCAGTAGGCT ATCGCGTCTG TGTACCAAGT
GAGACGGAAT GGATGAGCGC GGCCTATAAT GATGAACATG GGCAATTTCA TAACTACTCA
TGGGGAAATC AGCCCTTAAC TCCTGAGCAT GCAGTCTATG ATTGGGTTGA GGAACGGCGG
CCAGCCCCAG TCGGCTTAAG TAGGATGGGT GATGCACCAT GTGGTGCTGC GGACATGACA
GGCAATCTGT GGGAATGGAC AGCGACACTG GATGGAAAGC AGGACGAGCA CATTGATGAA
TCTGCGGTGA ATGATGCCTG TTTGATCACA CTACGCGGTG GATCTTGTTA TGATAATGTT
ACAACGATTC TTTTTGCTGC GAATGATACA TCGCTCCCGA TAAATGTTAG TTACAATCGT
GGATTTCGGT GTGTGATTGC CCGGCGTTGA
 
Protein sequence
MPDTTIGSVN TNYSMIDGPV VGINLGTIIY GRAPEEGERQ SLVRYLEQLS NSHRKIRVIG 
LGPSRLESGI DLASVYIMLA VQKRYRIVRK LTSFEIIDYQ RQKLRIPHEL KPDRCLPDQA
IIKIGKHQRY GWLMFRAELA TETIAQYQYL ILCGAPGSGK STFAKHLVWA LAQRGLDQIN
HQTHLRGWTD KRQLLPIFMP LRQLAGALAG NDLGLHAEPK IGLLLDALCD YLQTHYGLDE
PRTLLTAGLN QRHKVLFVFD GLDEVPVEAN EHSLDRASLL RFLRIFADHQ PNARMLITCR
SRAWTSEYRM ITQWPMHELA HLTGGQITHF VHYWFPQLVL SGVIGHDEAQ RYSTELLKAL
QHPKRQKLRL MAENPLLLSM MIFVLAENGV LPRDRHSLYE QVLGQLLGQW DAKREGDNLG
QAIGDERITS QELRNRVLDR LCYHAHMQAL SVDGRGRIHG RELRLELMDY FNRVKVADPY
RAAERCIAYI DQRSGLLHPE DAGMVYAFAH LTLQEHSAGR HLLFYESIGQ ILALRHDDRW
REPIFLGVGC LTSESLGSSK ISELLTALID RYDYGSDTCK PSHVWYRDVV LAAELGFDRD
WGLLSGTGID VRRIKREIRL GVVQMLHDRQ HAQSALEYFY GAAMKPTPLL VKERQHAAEL
LAGLGDPRYP IDGTQWQQET THLSQQFGRE GTHYWRYMPA GQYQRGDAGT DIAETIEKHL
GDFDPAGMNH RGQGDTQIGD VGVVPHPYWI GQFMVTVEQY QAFIQAGGYH TDRWWSTHGK
AWKTMIACSE PWWWEQQTLQ QYINQPIYGV SWYEAVAYCN WLNHYLQPML PVGYRVCVPS
ETEWMSAAYN DEHGQFHNYS WGNQPLTPEH AVYDWVEERR PAPVGLSRMG DAPCGAADMT
GNLWEWTATL DGKQDEHIDE SAVNDACLIT LRGGSCYDNV TTILFAANDT SLPINVSYNR
GFRCVIARR