Gene Haur_3995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3995 
Symbol 
ID5735856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5100143 
End bp5102119 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content52% 
IMG OID641281145 
Productputative transcriptional regulator 
Protein accessionYP_001546755 
Protein GI159900508 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.480395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAGC CAGACTTGTT GGCCTACAAC CTAGCAACGC TCGAAAAAAT GCTGGCGGAA 
GCTTTGCGCC ATGCCCGTGA GGGTGATGTG TTGTTTTTGG CGAGTTCGCC CTTAGCAACC
TGCCAGATTA TCGAACGTTG GTATGTTGCT GATACGCCCC AAGGTTCGGC TGAGCTTGGG
CGGGCAGTTC AGGCGCTGTT GTGGTGGCTG ATTGAGCGGA TTCGCCCCCA TGGTGAGCGC
CATTGGCTGG CCTTGCCTTG GCGAGCATAT AACGTGTTGG CGGGCTTTTA TATCGAAGGC
TTGCGGATTG CCGATCTCGC CGAGGCCATG GGCGTGGTCG AACAAACGAT CTATCCAATT
CGCAACCAAG CAGTGCAAAG CTTGGCCAAA TTGCTGCTCG AAGAGTTACA GCAACCAAGC
ACCACACTTC CCAACCATGC GATTATTGGG TTGTACCCGC AATTAACGGT TGCAGAACAA
ACCCTTGCCC GAATGTTGGC CTTCAATCAG CAACCACTAG CCCAACGTTG GCTGGAGCAT
TGGCTTGAAT TAAATGCGAT TGCCGAACCC ATAATTCAAC AATTAGCCGA TCATGGCCTG
ATTAAAAGTG AAAGTTTGGC CTTGGTTGAA AAGCTGCGGC CCTTTTTGCA AAGCCAAATT
AGCGCTCCCG AACGTCGGCG CTGGCATCAA CAATTAGCTG AATTATTACA ACCAAGCGAG
CCACTCCAAT CAATTCAGCA TTGGTTGCAA GCCCAGGCCT ATGATCAAGC AGCCAGCCTA
ATCATCGCTG AGCATCAAAC GATTGTTGAT AATTTGCAGG GTCGCGCACT GCGCGAATTG
TTGGCCCAGA TTCAAGCCCA CGATATTCAA CAACCACAGC TGTGGTTTCG GCTCAAGTTG
GTCAGTGGCG ATGTGGCCAT GACCATGAAC GATGTCCAAA CCGCCTTGGG CGAATATCAA
GCGGCTTTGG CTGCCAACGA GCCGTTGCTC AAAGCCGAGG CCTATTACAA GCGGGGCAAG
GCTTTTCGCT CGCAAAGCAC CATGGAAGCG CAAGCCCACT TCAACTATAG CATTGCGATT
TTGGAACGGG CCGCACCCAA CGACCCCTTG CGCTATCGGG TTTGTTTGGA GCAAGCATGG
ATGTGGTTTC AGGATCAGCG TGATTTTGAG CAGGCCCAAA CCAGCCTTGC GCAAGCCGCT
AGCTTGATCG ATCCGCTTGA TCGTGGGGCT TGGGCCGAGC TGGCTAACGC CCGCGGGATG
TTCTACGCTC ATCGCGGCGA GCATGCTGAG GCGATCAACC AGCATCAAGC GGCATGGTTG
GCGGCCAATG AGGTGAATCA TAGTTTACTT ATGACCCATA TCGCCCATAA TTTAGGCTAC
GATTATCTTG ATTTGGGCCA CTACTCGCAA GCGCTTGATT ATTTTGAGCA AAGCCTGAAC
TTGGCTAATC GCACGGGCAA TCGGCGCATG CAAGGCCTAT GTCAAAAAAG CATTGGGGCA
TGTTGCTTCT GGATGCAAGA ATTTACCCTA GCGGTCGAGC ATTATCTGGC GGCCTATCAG
ATTTTTGCGG CCATGCACAA CCACAATTGG CAAGCCAACA CTTGCTACGA TTTAGCCGAA
GCCTATGCTG AGCTTGGCCA AAGCCAACTG ATGCGCCATT ACTATGCCGA GGCGATTCAA
TTGGCCCAAG CGAGTGGCCT TGATCGCTTG TTAAACGATC TGCATGGCTT GGCCGAAAAC
TACCCGGGCC TGTACCCACC AACGATCGAA TTAAACGAGC GTCAGCAGCG AATTTTCGAT
TATCTCAAAC AACATCCTTC GATCACCAAC CGCGATTATC GTGAGCTAAC CCAAATTTCG
CCCAAGCAAG CCGCCCGCGA TCTGAATGAT TTGGTGGAGC GCAATGTTTT AGTGCGTTCT
GGTGATGGCC GTTCAACCAG CTACCAACTG CCTCAATCGA AGGCCAACGA AGCTTAA
 
Protein sequence
MNQPDLLAYN LATLEKMLAE ALRHAREGDV LFLASSPLAT CQIIERWYVA DTPQGSAELG 
RAVQALLWWL IERIRPHGER HWLALPWRAY NVLAGFYIEG LRIADLAEAM GVVEQTIYPI
RNQAVQSLAK LLLEELQQPS TTLPNHAIIG LYPQLTVAEQ TLARMLAFNQ QPLAQRWLEH
WLELNAIAEP IIQQLADHGL IKSESLALVE KLRPFLQSQI SAPERRRWHQ QLAELLQPSE
PLQSIQHWLQ AQAYDQAASL IIAEHQTIVD NLQGRALREL LAQIQAHDIQ QPQLWFRLKL
VSGDVAMTMN DVQTALGEYQ AALAANEPLL KAEAYYKRGK AFRSQSTMEA QAHFNYSIAI
LERAAPNDPL RYRVCLEQAW MWFQDQRDFE QAQTSLAQAA SLIDPLDRGA WAELANARGM
FYAHRGEHAE AINQHQAAWL AANEVNHSLL MTHIAHNLGY DYLDLGHYSQ ALDYFEQSLN
LANRTGNRRM QGLCQKSIGA CCFWMQEFTL AVEHYLAAYQ IFAAMHNHNW QANTCYDLAE
AYAELGQSQL MRHYYAEAIQ LAQASGLDRL LNDLHGLAEN YPGLYPPTIE LNERQQRIFD
YLKQHPSITN RDYRELTQIS PKQAARDLND LVERNVLVRS GDGRSTSYQL PQSKANEA