Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3961 |
Symbol | |
ID | 6064488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4351067 |
End bp | 4352068 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641603374 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001726889 |
Protein GI | 170021935 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0697467 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAATAA TAAATGTGAC GATAGAATTA CGAGAAGAGT TTACCTTAGT AAAGAAAAGG AAGTATCCAT TAAGGTGGTA TATTATTATA AATAATGTCG CCGTCCATAA TAATACTATC GAAATTCCAC AGACAGTAAA TGGTGGTTAC GATTTTTCAC ACCTTAGCCT GAAAGGTATC GTGATTAAAG ATGAAGATTT ATCCAATTCG AATTTTGCAG GTTGCAGACT ACAAAACGCT ATTTTCCAGG ACTGTAATAT GTATAAAACG AATTTTTATT ACGCCATAAT GGAAAAAATA CTTTTTGATA ATTGTATTCT CGATGACTCA AATTTCGCTC AGATAAAAAT GGCCGACGGA ACTCTAAATG CATGCTCCGC TATGCATGTT CAATTCTACA ATGCAGCAAT GAATAGAGCC AATATTAAAA ATACCTTTCT TGACTATTCA AATTTTTATA TGGCGTACAT GGCTGAGGTA AATCTTTATA AAGTAATAGC GCCATATGTT AATTTATTTA AAGCCGACCT TAGTTTCTCT AAACTCGATT TAATTAACTT TGAACATGCT GATCTGTCTC GCGTCAATCT GAACAAAGCA ATCCTCCAGA ATATAAACTT AATTGATAGC AAACTCTTTT GTACGTGGCT AACAAATACA TTCCTCGAAA TGGTTATATG TACCGACTCT AATATGGCTA ATGTTAATTT TAATAATGCC AATTTAAGCA ATTGCCATTT CAACTGTTCT GTTTTAACAA AAGCCTGGAT GTTTAATATC CGTCTCTATC GTGTTAATTT CGATGAGGCT AGCGTCCAGG GAATGGGTAT TACCATTCTC CGTGGTGAGG AAAATATCTC CATTAATAGT GATACCCTGG TAACACTACA GAAATTCTTT GAAGAAGATT GTACCTCTCA TACTGGCATG TCACAAACTG AGGATAATAT TAATGCAGTC GCTATGAAGA TTACTGCAGA TATTATGCAA CACGCAGATT GA
|
Protein sequence | MQIINVTIEL REEFTLVKKR KYPLRWYIII NNVAVHNNTI EIPQTVNGGY DFSHLSLKGI VIKDEDLSNS NFAGCRLQNA IFQDCNMYKT NFYYAIMEKI LFDNCILDDS NFAQIKMADG TLNACSAMHV QFYNAAMNRA NIKNTFLDYS NFYMAYMAEV NLYKVIAPYV NLFKADLSFS KLDLINFEHA DLSRVNLNKA ILQNINLIDS KLFCTWLTNT FLEMVICTDS NMANVNFNNA NLSNCHFNCS VLTKAWMFNI RLYRVNFDEA SVQGMGITIL RGEENISINS DTLVTLQKFF EEDCTSHTGM SQTEDNINAV AMKITADIMQ HAD
|
| |