Gene Haur_4797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4797 
Symbol 
ID5736641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6114581 
End bp6115591 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content52% 
IMG OID641281962 
ProductLacI family transcription regulator 
Protein accessionYP_001547556 
Protein GI159901309 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.184791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATATA CGATTAAAGA TGTGGCGAAA CGGGCGGGAG TTGGCATTGC AACCGTTTCG 
CGGGTGCTCA ACGAATCGCC CAACGTGCTA CCAGAAACTC GTGCCAGAGT TTTGGCAGTG
ATCGACGAGC TTGGTTATCG ACCAAATCAT GCTGCTCGCC AACTGGTAAC CGCCAAAACC
AATGCGATTG CGATTATTCT GCCCTTCCTT ACGCGACCAT TTTTTATTGA AGTGCTGCGG
GGCATCGAAG CGGTTGTGGC CAATTCTGAA TATCAGTTGA TTATTTTTAA TGTTGACTCG
CCGGAACAAC GCACGCGCTA TTTTAATACG CTGCCATTTT TGGGGCGTAC CGATGGGCTG
TTGATTGTTT CGTTGCCTTT GGCTCAGCCT GAAATCAAAC GCTTGCAAGC GGCCAATTTG
CCAGCAGTGA TGATCGACAC TCAAGTTGCC AATTTACCAT CAGTGGTCGT TGATAATGTG
GGCGGTGCAT TCAAAGCCGT CGAACATCTA ATCAGCCAAG GCCATCAGCG GATTGGCTTT
GTTTCGGGTC AGTTGGAACC AGATTTGGGT TTTACCGTCA ACCGCGATCG GCGGCGAGGC
TACGAGGCTG CTCTCACTGC GCATCATCTG CCATTGCAAC CTGAATATCT GCGGCCAGGC
TTTGATCGGC GCGATTGGGG CCATCAGGCG GCGCTTGAAT TGCTGGCATT ACCTGAGTCA
CCGAGCGCTA TTTTTGCTGC CAACGACGAT TTAGCCTTTG GCGTGATCGA TGCTTTGCGC
GAACGCGGCT TGAAGGCAGG CGAAGACATC GCCGTGGTTG GCTATGATGA TCTCGAAATG
GCGCAGTTGG TGGGTTTAAC CACAATTCAT CAGCCAATGG AGCAAATGGG CCGTAAGGGA
GCTGAGGTTT TGCTGGCCGC GCTGAATGAA GGCACACGCC GCCCAACCCT CTATACCTTG
CCCGTTAATC TGATCGAACG CGCCAGCAGC AGCAAACTCA GCCAAGCATA A
 
Protein sequence
MAYTIKDVAK RAGVGIATVS RVLNESPNVL PETRARVLAV IDELGYRPNH AARQLVTAKT 
NAIAIILPFL TRPFFIEVLR GIEAVVANSE YQLIIFNVDS PEQRTRYFNT LPFLGRTDGL
LIVSLPLAQP EIKRLQAANL PAVMIDTQVA NLPSVVVDNV GGAFKAVEHL ISQGHQRIGF
VSGQLEPDLG FTVNRDRRRG YEAALTAHHL PLQPEYLRPG FDRRDWGHQA ALELLALPES
PSAIFAANDD LAFGVIDALR ERGLKAGEDI AVVGYDDLEM AQLVGLTTIH QPMEQMGRKG
AEVLLAALNE GTRRPTLYTL PVNLIERASS SKLSQA