Gene Haur_1866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1866 
Symbol 
ID5733755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2201175 
End bp2203613 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content48% 
IMG OID641279010 
ProductXRE family transcriptional regulator 
Protein accessionYP_001544637 
Protein GI159898390 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAA GCTTCGGTTA CTGGCTTAAA CAGCGGCGTA AAGAGCTTAA TTTTACCCAA 
GAATATTTAG CCGAGTTGGT AAGCTGTTCA ACCATTACTA TTCGCAAAAT CGAGTCGAAT
GAACGGCGAC CTTCGCGCCA GATTGCGGCT CGGATCGCTA AATTTTGTCA AGTAGAAGCC
AATCGGGCCT TTGTTGATGC AGCATGGGCT GGGCAATCGC CTAGCCCATC TGATGGTGGC
TCGCCACCTG AGCCAGCTCC TTCGAACTTA CTGCCCCCAT TTAGTTCAAT CATTGGCCGC
GATTCGGCGA TTGAATCAAT TTGTGTCCAA TTTCAAGCCC AAAAAGCCCG TTTAGTTACG
ATTGTCGGCT CACCTGGCGT TGGCAAAACC CGCCTGGCCC AAGCCATTGG CCAACAGTTA
CTCACACATT TTAGCGATGG CGTTTTTTGG ATTAGCCTCG ATCCAATCGT CAATGCTAGC
CTTGTCCCAT CGTTAATTAC GCGGGTACTC GGCATTCACG AAAACCCCAA TCAATCGATC
GAAGAAACAA TTTTCAACTG GCTCAAAAAT CGCCATTTGC TCCTCATTCT CGATAATTGT
GAGCATATTA TTGAGTTGCG CCAGTTTGTA AACCAACTGT TAAGTTATTG TCCAACCCTC
TCGATTCTTG CGACCAGCCG CGAAGTGTTG CATTTGCGCT GGGAACAGCG CTTTCCATTG
CGCCCGCTGA CGGTTCCAGT ACGCGGTATG CAGCTTGATC TCGCGCAACT GGCCCAAATT
CCAGCGATTG CGCTATTTTT AGAGCGCAGT CGGGCGATCA ATCCTCAGGC CGAGTTGAAT
GCATCGAATG CCCGAGCAAT TAGCACGATT TGTATGCAGC TTGAAGGCTT GCCGCTAAGT
ATTGAGTTAA TTGCTGCGCG TAGCGCCATG CTCAGCCCTC AAATGCTGGT GCATCGGCTG
AATAATCAAT TGAATGTACT GACCCAAGGC TCGCGCGATT TGCCTCATCG CCAACAAACC
TTGCGCAATG CCATCCAATG GAGTATCGAT TTGCTCGATA GTGCCGAGCA ATTTCTGCTG
GTAGCGCTGG CATTAGCTCC CGAAAGTTGT ACCCTCCTGA GCCTAGAAGC GCTTGCTGAT
TGCTATAGCC CGTGGCCGTG GTCGATTTTC GATGGCCTAA CCAACTTGTT CGATAAAAGC
CTAATTTGGA TTCAGCAGCA GCAAACTGAT GAGCCACGCT TTGGGATGTT GCGGGTTTTG
CGTGAATATG TGCTTGAGCA GCTTGCTGAG CCAACAACCA TTCAGCAATT ACGCCAAAGT
TTTGCCAGCT ACTACCTGAA TATTGCCGAA ACCATTTATC AAAAGATGCT CAACTCGCGT
ACCAATAGCC TTTTTCAAGA GATTGCCGCT GAGTATTATA ATTTTCACAC CGTTATCACA
TGGTGCCTTG AACCACCATA TGATCTGGAA AATGCGATTA AGCTAATTGC GACGTTAATC
GATTTTCTAC ATCTCTATGG CTATCAACGC GAGGGGATTA GTTGGTTACA ACACATTTTG
GGCCTGATTG AACAACAAAC AGTCACGCTT AGCCCAGCGA TTCTGGCCGA TGCCTATAAC
GCCTTAGGCT TTTTATACTA CCATCAGGGC AATATTAACC AAGCCCAACA CTTTTTTGAG
CGCGTATTGG AGCTTATTGG CGGCCACACA TCGTTTAAAC ATGCACGAAT TTTGTATAAT
TTAGGTTTAG TTAAAAAGAA CAAAGGCGAA TTTCTTCAGG CCGAGGCCGA TTTACAAGCC
AGTTTAGCAA GTTGGCGCAC CCTTGGTTTA CAGCCAGGCG AAGCCTATTC GCTCTGGGGG
TTGGGCAGTT TAGCCCTCGA CCAAGGCCTC TATACTCATG GGTTAACCTA CCTGCAACAA
AGCTTGGCAA TTTGGCAAAC GCTTGAATCA ACTCATGGAC AAGTGATGGT GTTAAGTGAT
TTGGCCGAGT TAGCCTTACT ACAAGCCAAT CCGCATGAGG CTGAGCAAAT ATTAGCCCAG
ATTAAAACGA TTGTTGAGGC CAGCAATTAT ACAATCACAA GTTCACGTAT AGCCTTGCTC
GAAGGTAAAT GTGCGATGCA ACGCCACGAT TTTAGCCATG CCCAAACCTG CTTCGAAGAA
GCCGAGGAGA TCGCTGAAGA ACAGCAATCA ACCGCCTATT TAGCCAAAAT CCACCTCGAA
CAGGCTAAAC TGGCTTTGGT GCAGGCACAC TATCATCAGG CCAGTTATCA TGGCTATGAA
GGGTTGCGCC TAGCGACCAT GCTTGAACAT CAGACTGGGA TTGCCAAGGC CCACCACGTG
CTGGCCCAAG TCTATCAGCA GTTGGCCAAT CCGAGTCAAG CCGAGCAACA TTGGCAAGCT
TATGCAGCAA TTTATCAACA CGTTGGTTTA GTGCCATAA
 
Protein sequence
MSESFGYWLK QRRKELNFTQ EYLAELVSCS TITIRKIESN ERRPSRQIAA RIAKFCQVEA 
NRAFVDAAWA GQSPSPSDGG SPPEPAPSNL LPPFSSIIGR DSAIESICVQ FQAQKARLVT
IVGSPGVGKT RLAQAIGQQL LTHFSDGVFW ISLDPIVNAS LVPSLITRVL GIHENPNQSI
EETIFNWLKN RHLLLILDNC EHIIELRQFV NQLLSYCPTL SILATSREVL HLRWEQRFPL
RPLTVPVRGM QLDLAQLAQI PAIALFLERS RAINPQAELN ASNARAISTI CMQLEGLPLS
IELIAARSAM LSPQMLVHRL NNQLNVLTQG SRDLPHRQQT LRNAIQWSID LLDSAEQFLL
VALALAPESC TLLSLEALAD CYSPWPWSIF DGLTNLFDKS LIWIQQQQTD EPRFGMLRVL
REYVLEQLAE PTTIQQLRQS FASYYLNIAE TIYQKMLNSR TNSLFQEIAA EYYNFHTVIT
WCLEPPYDLE NAIKLIATLI DFLHLYGYQR EGISWLQHIL GLIEQQTVTL SPAILADAYN
ALGFLYYHQG NINQAQHFFE RVLELIGGHT SFKHARILYN LGLVKKNKGE FLQAEADLQA
SLASWRTLGL QPGEAYSLWG LGSLALDQGL YTHGLTYLQQ SLAIWQTLES THGQVMVLSD
LAELALLQAN PHEAEQILAQ IKTIVEASNY TITSSRIALL EGKCAMQRHD FSHAQTCFEE
AEEIAEEQQS TAYLAKIHLE QAKLALVQAH YHQASYHGYE GLRLATMLEH QTGIAKAHHV
LAQVYQQLAN PSQAEQHWQA YAAIYQHVGL VP