Gene EcolC_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1971 
Symbol 
ID6068221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2177822 
End bp2178847 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content54% 
IMG OID641601385 
ProductDNA-binding transcriptional repressor PurR 
Protein accessionYP_001724944 
Protein GI170019990 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000994316 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAACAA TAAAAGATGT AGCGAAACGA GCAAACGTTT CCACTACAAC TGTGTCACAC 
GTGATCAACA AAACACGTTT CGTCGCTGAA GAAACGCGCA ACGCCGTGTG GGCAGCGATC
AAAGAATTAC ACTACTCCCC TAGCGCGGTG GCGCGTAGCC TGAAGGTTAA CCACACCAAG
TCTATCGGTT TGCTGGCGAC CAGCAGCGAA GCGGCCTATT TTGCCGAGAT CATTGAAGCA
GTTGAAAAAA ATTGCTTCCA GAAAGGTTAC ACCCTGATTC TGGGTAATGC GTGGAACAAT
CTTGAGAAAC AGCGGGCTTA TCTGTCGATG ATGGCGCAAA AACGCGTCGA TGGTCTGCTG
GTGATGTGTT CTGAGTACCC AGAGCCGTTG CTGGCGATGC TGGAAGAGTA TCGCCATATC
CCAATGGTGG TGATGGACTG GGGTGAAGCA AAAGCTGACT TCACCGATGC GGTCATTGAT
AACGCGTTCG AAGGCGGCTA CATGGCCGGG CGTTATCTGA TTGAACGCGG TCACCGCGAA
ATCGGCGTCA TCCCCGGCCC GCTGGAACGT AACACCGGCG CAGGCCGCCT TGCCGGTTTT
ATGAAGGCGA TGGAAGAAGC GATGATCAAG GTGCCGGAAA GCTGGATTGT TCAGGGTGAC
TTTGAACCCG AATCTGGTTA TCGCGCCATG CAGCAAATAC TGTCGCAGCC GCATCGCCCT
ACTGCCGTCT TCTGTGGTGG CGATATCATG GCAATGGGCG CACTTTGTGC TGCTGATGAA
ATGGGTCTGC GCGTCCCGCA GGATGTTTCG CTGATCGGTT ATGATAACGT GCGCAACGCC
CGCTATTTTA CGCCGGCGCT AACCACGATC CACCAGCCAA AAGATTCGCT GGGTGAAACA
GCGTTCAACA TGCTGTTGGA TCGTATCGTC AACAAACGTG AAGAACCGCA GTCCATTGAA
GTGCATCCGC GCTTGATTGA ACGCCGCTCC GTGGCTGACG GCCCGTTCCG CGACTATCGT
CGTTAA
 
Protein sequence
MATIKDVAKR ANVSTTTVSH VINKTRFVAE ETRNAVWAAI KELHYSPSAV ARSLKVNHTK 
SIGLLATSSE AAYFAEIIEA VEKNCFQKGY TLILGNAWNN LEKQRAYLSM MAQKRVDGLL
VMCSEYPEPL LAMLEEYRHI PMVVMDWGEA KADFTDAVID NAFEGGYMAG RYLIERGHRE
IGVIPGPLER NTGAGRLAGF MKAMEEAMIK VPESWIVQGD FEPESGYRAM QQILSQPHRP
TAVFCGGDIM AMGALCAADE MGLRVPQDVS LIGYDNVRNA RYFTPALTTI HQPKDSLGET
AFNMLLDRIV NKREEPQSIE VHPRLIERRS VADGPFRDYR R