Gene Haur_3806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3806 
Symbol 
ID5735670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4777512 
End bp4778783 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content50% 
IMG OID641280958 
ProductSARP family transcriptional regulator 
Protein accessionYP_001546570 
Protein GI159900323 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0119583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAGG TAGTACGTGT TTTTCATCAA CTCTTGCCCG ATTTTGTTGA TCATGTGTCC 
GAGCAAATTG TTGCTAACCA TGTGCCTGTG TATGCCAGCT TGCCGCAGCA GCATGTCAAA
ATGGCGCTCT ACAATGCCAT TCATTCAATC GAGATCGATT TAGCCCAAGG CACAACCTCA
ACCTATGCCG ATTATTGGCG TGAGGTGGCG GTGCAACGTG CCCAACAAGG CATTTCACCA
GTTCACAGCA TGCTCGTTAC CCATCTTTCA ACCAATGTGA TGACCCAATT TTTGAAGCAA
GCCTTGGATC GTGAGCCAGA AGCCTTGGCC TGGTGGCTCG AACGTACCCA CACGATTATT
TCGCTGGGCA TGTTGGTAAT GACCGAGGCA CGAATTAATG CCTTGCAACA ACTTGGGCAG
CTAGGAAGCG AGCCAGCCAG CAGCGGGATT ATCATCCACG AACAGCCCAA TTTGATTTTG
CCACCTAGCC GCTGGCAAAC CCCACAAGTG TTGCATTTGC AAGCCTTTGG CCAAATGCGG
GCCTGGCGTG GCTCAAGCGA AGTGCTCAAT TGGGGTCGTA AATCGGCAAT CGCCTTGCTT
GGGATTTTAA TTACCCAACG CGGCCAATGG ATTCAGCGTG AGCAAATTTG CGATCTGTTC
TGGCCTGATT TAGCCCCAAA TCAAGCCGAA GCCCACTTTA AAGTTGCCCT GAATGCGCTA
ACCGCTGTGT TGGAGCCAGA ACGCCCGACG CGCCAAGCCT CAAGCTATAT TCAACGCCGT
AATACTGCCT ATCGTTTGGC GTTTGATACT GCGCCAATTC AGCTGGATGT GCTACGCTTT
GAGCAATTGC TGCAACGTGC CAACCACGCT AGCAATCCAC TAGAAGCCAT TAATTACTAT
CGCCAAGCAC TCAATTTGTA TGCTGGCGAT TTTTTGGGCG ATTGTTTGTA TAGCGATTGG
GCGAATGCTG TGCGTGAGCA ATTGCGCCAT CATTTTGTGC AAGCTGCCTG CGAATTAGCC
CAATTATTGT TGGCTGAACA GCAACCAACC GAAGCCTTGG AGTGGGCCGA AGCCGCTTTA
CAAGCTGATC CCTATCAAGA AAACGCCTAT CAAGCTAGCT TTATGGCCTA TGCCCAACTT
GGCAATCGGG TGCAATTGCA ACGCAGCTAT CAACGTTGCC AACAAGTGCT TGAGCACGAT
TTGGGCTTAG CTCCAATGCC CACCACCAAA GCCGCCTACC AACGAGCCGA ACAAACGCTG
CATCAACTGT AA
 
Protein sequence
MQQVVRVFHQ LLPDFVDHVS EQIVANHVPV YASLPQQHVK MALYNAIHSI EIDLAQGTTS 
TYADYWREVA VQRAQQGISP VHSMLVTHLS TNVMTQFLKQ ALDREPEALA WWLERTHTII
SLGMLVMTEA RINALQQLGQ LGSEPASSGI IIHEQPNLIL PPSRWQTPQV LHLQAFGQMR
AWRGSSEVLN WGRKSAIALL GILITQRGQW IQREQICDLF WPDLAPNQAE AHFKVALNAL
TAVLEPERPT RQASSYIQRR NTAYRLAFDT APIQLDVLRF EQLLQRANHA SNPLEAINYY
RQALNLYAGD FLGDCLYSDW ANAVREQLRH HFVQAACELA QLLLAEQQPT EALEWAEAAL
QADPYQENAY QASFMAYAQL GNRVQLQRSY QRCQQVLEHD LGLAPMPTTK AAYQRAEQTL
HQL