Gene Haur_2121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2121 
Symbol 
ID5734009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2663844 
End bp2664845 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content56% 
IMG OID641279262 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001544889 
Protein GI159898642 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.173687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCCC TTGCACTCTG CCGCTGTGCT ATCGCACTGG CCATAGCCTG GACTCTGGCA 
GCGTGCGGCG GCTCCTCGTC TGGAGTCGGC GCAGTAGCGC CCTCCTCCCT CGATCTCAAG
ACCAAAACTT ATCCTGAGCT GGTTGTCGGG TTTGCTCAAA TCGGAGCCGA AAGCGAGTGG
CGCACCGCTA ACACCCGCTC GATTCAAGAT ACGGCGAATC AACTTGGTGT CGAATTGGCG
CTTTCCGATG CGCAACAGCA GCAGGAAAAT CAGATCAAGG CCATCCGTTC GTTTATTGCT
CAAGGTGTCG ATGTCATCGG AGTCTCGCCC GTGGTCGAGA CTGGCTGGGA CGAGGTTTTC
GCCGAGGTCA AGCAGGCTGG AATTCCGTTG ATCTTGCTCG ATCGCAACGC CAATGTGCCA
GATGATCTCT ATAGTGTCCG CATCGGATCA GACTTCGTGG AAGAGGGTCG GCGGGCCTGC
GGTGAGATGG CTCGACTGCT GGATGGTCAA GGTGCGATTG TCGTCTTAGA AGGCACCCAA
GGCTCAGCCC CAATGATCGG ACGAGGTACT GGCTTTCAAG AATGCCTGCA ATCTTATCCC
GCGCTTCACA TAATCGACAG CCAGTCCGGT GATTTCATTC GCGCCCGTGG CAAAGAGGAG
ATGGCAGCGT TGCTGCAAAA ACACGGCAAC AGCATCGACG GCGTGTTTGC CCAGAACGAT
GACATGGCGC TTGGCGCGAT CGAGGCCATC GAGGAGTATG GGCTACGGCC TGGCGTTGAT
ATCAAAATTG TTTCGATCGA TGCGGTACGG GCTGCCTTCG AAGCAATGAT TGACGGCAAG
CTTAACGCCA CAATCGAGTG TAACCCGCTG CTTGGCCCGC TGTTTTTCGC CACAGCCCTG
AACTTGGCTA ACGGCATACC GGTTGAAAAA TGGATCAAGC CCGACGAGGG CATCTACCGA
CAGGATACCG CCGCGCAGGA ATTGTCTAAG CGCGAATACT AG
 
Protein sequence
MNALALCRCA IALAIAWTLA ACGGSSSGVG AVAPSSLDLK TKTYPELVVG FAQIGAESEW 
RTANTRSIQD TANQLGVELA LSDAQQQQEN QIKAIRSFIA QGVDVIGVSP VVETGWDEVF
AEVKQAGIPL ILLDRNANVP DDLYSVRIGS DFVEEGRRAC GEMARLLDGQ GAIVVLEGTQ
GSAPMIGRGT GFQECLQSYP ALHIIDSQSG DFIRARGKEE MAALLQKHGN SIDGVFAQND
DMALGAIEAI EEYGLRPGVD IKIVSIDAVR AAFEAMIDGK LNATIECNPL LGPLFFATAL
NLANGIPVEK WIKPDEGIYR QDTAAQELSK REY