Gene Haur_1099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1099 
Symbol 
ID5732990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1259478 
End bp1260557 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content48% 
IMG OID641278237 
ProductArsR family transcriptional regulator 
Protein accessionYP_001543875 
Protein GI159897628 
COG category[K] Transcription 
COG ID[COG0640] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0967799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC TCGTTCGCGC TCAACCGGCA TTTAAAGTTG ATTTTGTCCC ATCACTTGGG 
TTGGATCTGC TTTCGACAAT GGGTCTGATT GGAATTGTCC ACGATTTTGA AGGCTTGGAT
GCATGGCTGG TTGAGGCTGC TGCGAGTGTG CCCCCGCGCT TACGCCACGA TATTCAGCTG
GCAATGCGCA TGGGCGTTTA TCCCTATGTG GTGGTCGAAA CCGTCTCTGA CCAAATTTTA
CAGCCTGGTG CTGCTGGCCA CGACGATTTT AATGGCTTGA TTGAAGACCT CAAAGCGCTT
TCACCGCAAG AATGTGCCGC GATGGTACAT AAAATCGTGC AACGCACCGC CGCCAACGCT
GATGTTGAAT TATTGCACAC GCCAGCCGAA ATTATTGCCG ATCAAGAGCA ATTAGAAGAA
TTATTGGCTA AAATGCAGTT TCCGGTCGAT ACCGATGAGC TGATTGAATT ATTGCAACAG
CCAACCGAAT GGCGCGATTT ATTGGTCTCA ACGATTCAGC GCTTTTGGGA CCGGATTTAT
CGTGAGCAAT ATGAACTGCA ACAAGCCCGC CGCGAACGTA ATGCCCATTA TCATCGCACA
CATCAATATA GCGTCAACTT CCGCGATTTA TTTGCTGGAG TAACTGGCCG CCGCTTGCCC
GACCATATTC ATGAACGACT TGGCACGATT AGCACTGTAC GTTTTGTTCC ATCGCAATAT
ATTGGGCCAT ACTTGTCGTT TCTTTTCAAT GGATCATTAC TCACGGTGTT TTATAATAGC
AGCACCACAC CAGCTGAAGG CGATGAGCAA ACTGAACGCA CGCAAAGCCT GTATCAGCCA
TTAGCAGCCT TGGCCGATAA AACGCGCTTG CAAATTATGA CGTTGTTGCA TGGCCGCGAA
TTGTATGCCC AAGAAATTGT CAATTTGCTC GATATTCATC AATCGGCGGT TTCACGCCAT
TTGAAGCTGA TGGAAACTTC AGGTGTGCTG AATGTTCGCC GCGACAAGGG TGCAAAATAT
TATTCGATCA ATCGCCAACG GATTGAAGAA ATTTCGGCTC GCCTACGCGA ATTTGTCTAA
 
Protein sequence
MTELVRAQPA FKVDFVPSLG LDLLSTMGLI GIVHDFEGLD AWLVEAAASV PPRLRHDIQL 
AMRMGVYPYV VVETVSDQIL QPGAAGHDDF NGLIEDLKAL SPQECAAMVH KIVQRTAANA
DVELLHTPAE IIADQEQLEE LLAKMQFPVD TDELIELLQQ PTEWRDLLVS TIQRFWDRIY
REQYELQQAR RERNAHYHRT HQYSVNFRDL FAGVTGRRLP DHIHERLGTI STVRFVPSQY
IGPYLSFLFN GSLLTVFYNS STTPAEGDEQ TERTQSLYQP LAALADKTRL QIMTLLHGRE
LYAQEIVNLL DIHQSAVSRH LKLMETSGVL NVRRDKGAKY YSINRQRIEE ISARLREFV