Gene Haur_0892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0892 
Symbol 
ID5732793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1019600 
End bp1020640 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content51% 
IMG OID641278024 
ProductLacI family transcription regulator 
Protein accessionYP_001543668 
Protein GI159897421 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00803151 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAGC GGATCACGAT GGAAGACATT GCGCGGCAAA GCGGTGTCTC GTTGGCAACA 
GTTTCATTAG TATTACGCGA CAAGCCTGGG ATTAACGACG AGACACGCCG CCGCGTGTTG
GATATTGCCC GTGATCTTGG TTATCGCAAG CGCTTGAATC ATGAGAAGTT GGTTTCGCAA
TCGTTGCACA ACGCAGGCGT AATTGTTAAG GCCTCAATTG GCGACGATAG CCCACTGACC
AACCCGTTTT ATGCCCCGAT TGTCGCAGGT ATCGAGGCCG CCTGTCGCAA AATGCATATT
AACTTAATGT ATGCCACTGT GCCAGTTGAT ATGGATAATC ATCCTCAAGA GATGCCCCGT
TTGCTCTCGG AAGATCACCT TGATGGGGTA TTGTTAGTTG GCGCATTCGC CGATGCAACC
ATCACCAAGC TTTTGCAACG TGAGGGCATT CCGGCGGTTT TGGTCGATGG CTACTCACAC
GAACATGTCT ACGATTCAGT TGTTTCAGAT AACTTCCGCG CAGCCTATGA AGCAGTCAGC
TATCTGATTA GCTACGGGCA TCGTCATATT GGCCTGATTG GGACAACCAA AGAGGCCTAC
CCTAGCATTG CCGAACGGCG CAAAGGCTAT ATTCAAGCCT TAACCGATCA TGGCATTCAT
GATCAGTATT TTGGTGATTG CTTGCTCACA ATGCACGAAG GCAGCGATAC ATCGAGCATT
CTGTTGCAGC GCCATCCGCA AATTACCGCG CTGTTTTGTG CCAACGATAT GATGGCGATT
GGTGCAACCC AAGCCGCGCG GGCGTTGCAT CGCCAAATTC CCCAAGATTT ATCAATTATT
GGTTTCGATA ATATTGATCT GGCTCAGCAT GTTGCGCCAG CGCTTACCAC AATGCATGTC
GATAAAGTCA GCATGGGGCG CTTTGCGGTG CAATTGTTAG CCAATCGAGC CGAATACCCA
GACCAGGCTC CGGCGACAGT CTCGCTGCGG CCACGGCTGC TTGAACGCCA ATCAGTTCAA
CGTTTGCAAC CACCAAAGTA G
 
Protein sequence
MTQRITMEDI ARQSGVSLAT VSLVLRDKPG INDETRRRVL DIARDLGYRK RLNHEKLVSQ 
SLHNAGVIVK ASIGDDSPLT NPFYAPIVAG IEAACRKMHI NLMYATVPVD MDNHPQEMPR
LLSEDHLDGV LLVGAFADAT ITKLLQREGI PAVLVDGYSH EHVYDSVVSD NFRAAYEAVS
YLISYGHRHI GLIGTTKEAY PSIAERRKGY IQALTDHGIH DQYFGDCLLT MHEGSDTSSI
LLQRHPQITA LFCANDMMAI GATQAARALH RQIPQDLSII GFDNIDLAQH VAPALTTMHV
DKVSMGRFAV QLLANRAEYP DQAPATVSLR PRLLERQSVQ RLQPPK