Gene Haur_4724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4724 
Symbol 
ID5736568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6033366 
End bp6035507 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content53% 
IMG OID641281889 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001547483 
Protein GI159901236 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTAC ACTGCTATTT TTTGGGGCAG CCCCAAATTG TTTACAACCA AACGGCAATC 
CATGTGACCA ATGCCAAGGC TGTGGCGCTG TTGGCCTATT TGGCATGTCA CAATCAACCG
CAACCGCGTG AACGAATTTT AGGCTTGCTC TGGGCTGAAA GTAGCAGTAG TGCTGCGCGA
AAGAACCTGA GCAATTGTCT TTGGCAGTTG CGCCAACAGC TTGGCGAGAG CTGCATCCTT
AGCCATGACG ATAGGCTGAG TTTGGGGGAG GCGGTTTGGA GCGATCTGCA GGCACTGTGG
CAATTGGATC AGCCTGATCA CGCAACCTTG CTCAATTGTT ATCGTGGCCC GTTGCTTGAT
GGCCTGAATG TACGCGATGC GCCAGAGTTT GAATTGTGGT TGTTGCACGA GCGCAATCGG
CTGCTGCAGC ATTATCTCCA ACTGCTTGAG CAAGCCTTAC AAACAACCAA TGATCAGCAA
CAAAAACTGC AACTTGCCAA TCATGGTTTG GGCGTTGACC CGCTGCACGA ACCATTCGTT
CAAGCGGCGA TTCAGGCTTG TTTGGCGCTT GAGCAACGGG CCAATGCCTT ACAGCACTAC
ACCAATTTCC AACATCAGCT TGAGCAACAA CTCGGCTTAG AGCCAGCCGC CGCCACTCAA
GCCTTGCGTC TGCAAATTCT CGGTACTTCC ACAAATCCAA CAATCAAGCC GCCGCATCTG
CCAACGCCGC GCAATACCAG CAAATTAATT GGCCGCGAGC CGGATTATGC GCTGTTGCAT
GCTGCTTGGC AACGCAGTTT GCAAGGCCAA CTTCAAGTGG TATTGCTCAC TGGTGAGTTG
GGCATTGGCA AAACCAGTTT ATGGCAAACA TGGCTAGGCC AAATTCGCGA GGGGCATCAG
GTTGTGGTTG CCCGTGGGCT GGAAATTACC CAATTGCTGC CATTCGCGCC CTTTTTGGCA
TGGACTGACC AAACCGCACT GCTTGATTGG CTGTTTGGAG CGCAATCGCC GCTATCGCCC
TACTGGCAAA GCGAATTGGC TCGCTTGATT CCCGATTTCA AACAGCGCGA GATTATTCCG
CCGCTGAGCA ATGCCACGCC TGCCGAAGAA CGCCGCCGCA TCTTCGAGGC CATGTTGCAA
GTACTGGCGC TGTTTGCCGA TCAGCCATTG GTATTGGTGC TCGATGACGC GCATTGGGCC
GATCAACTTT CGCTGGGCTG GCTGGGTTAT GCCGCCGAGC GCTTACAATC CAAACCTGTG
CTGATCATTT TGACCTATCG CGCCAACCAA GTACAGGGCG AATTGGCAAA TTTAATCTGG
CATTGGCAAC GCAACCATAT CGCCCAACGC CATGAACTAC AGCCGCTCAG CCCACGCGAA
AGCGAGCAAT TGGTATTGCA ACAAGGCGGC GATCAACAGC GGAGTGCCCA ACTGTATCGG
CGCAGTCGCG GTAATCCCTA CTTTTTACAT GAGCTATTAC ATGCTCCAAG CGAGCAAATT
CCTAGCTCGC TGGCCGATGT GGTGGGTTTG CGCTTAACCA ACCTGCCAAG TGCCGCCCAA
GCCTTGCTCT CTGCTGCCAT GATTTTGGGC ATCAACAGCA GGCTGGAACT GCTCCAACAA
GTCAGTCATT GCGCCGAAGA TCAAGCACTC GATATGCTTG ATCTGCTATT ACAGCAAGGT
ATTTTGGCCG AGCAAGCGGG CCAGTATCAA ATTGCCCATC CATTAATTGG CGAAGTTTGG
CAGCAGCAGC TGAGTCCTGC TCGTCGCCAA GTATTCCATC GGCGAGCTGC CGAAGCGCTA
GCCGAGAGCT ATGCAGGCAT GTTGCCGATG GTTGCTGTGC AGTTGGCAAG CCATTACGAA
GCGGCTGGCA AAGCCCACGA AGCTGCCCGC TATGCCGAAA TGGCAGGCTA CCATGCCTTG
ATTATGGCGG CTGGCTCAGA AGCAGTTGTG TTGTATCAAC GAGCGTTAAC CTTAGAGCCA
ACGCCATTGC GCCAACTTGG TTTGGGGCGA GCTTGGGTAT TGCAAGGCGA TTTAGCTGCT
GCTCGGGCGA ATTTCAATGC AGCGCGGCAA GCCTTTGAAT TGCTTGGTGA TTTTGAGGGT
GCGGCCCAAG CAAGCCTTCA ATTAGCAGCA AGTTATCAAT AG
 
Protein sequence
MDLHCYFLGQ PQIVYNQTAI HVTNAKAVAL LAYLACHNQP QPRERILGLL WAESSSSAAR 
KNLSNCLWQL RQQLGESCIL SHDDRLSLGE AVWSDLQALW QLDQPDHATL LNCYRGPLLD
GLNVRDAPEF ELWLLHERNR LLQHYLQLLE QALQTTNDQQ QKLQLANHGL GVDPLHEPFV
QAAIQACLAL EQRANALQHY TNFQHQLEQQ LGLEPAAATQ ALRLQILGTS TNPTIKPPHL
PTPRNTSKLI GREPDYALLH AAWQRSLQGQ LQVVLLTGEL GIGKTSLWQT WLGQIREGHQ
VVVARGLEIT QLLPFAPFLA WTDQTALLDW LFGAQSPLSP YWQSELARLI PDFKQREIIP
PLSNATPAEE RRRIFEAMLQ VLALFADQPL VLVLDDAHWA DQLSLGWLGY AAERLQSKPV
LIILTYRANQ VQGELANLIW HWQRNHIAQR HELQPLSPRE SEQLVLQQGG DQQRSAQLYR
RSRGNPYFLH ELLHAPSEQI PSSLADVVGL RLTNLPSAAQ ALLSAAMILG INSRLELLQQ
VSHCAEDQAL DMLDLLLQQG ILAEQAGQYQ IAHPLIGEVW QQQLSPARRQ VFHRRAAEAL
AESYAGMLPM VAVQLASHYE AAGKAHEAAR YAEMAGYHAL IMAAGSEAVV LYQRALTLEP
TPLRQLGLGR AWVLQGDLAA ARANFNAARQ AFELLGDFEG AAQASLQLAA SYQ