Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3173 |
Symbol | |
ID | 6066568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3477295 |
End bp | 3478581 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641602589 |
Product | putative capsular polysaccharide biosynthesis protein |
Protein accession | YP_001726123 |
Protein GI | 170021169 |
COG category | [S] Function unknown |
COG ID | [COG5338] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATTA AACTTTTGGC CATTACCGGA ATTGTCATTC TACAACCGGT ATGGGCCGAA CCCATCCCGA AATCACATAC AGGCATTGCG GGGGTAGATT TTCAAAGCGA CGTTGCAGCG AATTATGGTT ATTCAGATAA CATCACCTTT CAACCACACA GCAGCAAGGA GAAAGATTCT GTATTTCAGA GCATTACGCC GACGTTAAGT ATGGTAGGCG AGCGTTTTCA GGATAAATAT TTGCTGATGT ATTCCGGTGA CTACCGTCGT TACGATAAGG ACTCTGCCGA CAACTACAAC GATCATTTTT TCCGTTTTAA CGGCGCATGG CGATACGGGC TGAAACACGG ACTGACCCTG AATCTGGAAG ATTCTCTGGG GCATGAGGAG CGCGGGCGCG GAATTACAGA AGGTTTTGTC CAGGAACAAT TTAGACAGTT CGGTATTACG TCTCCTTTGA GCCCCCATTT TGTGAATAGC GAATTGCGTT ACAGTTATGG TGCACCTGAA GGACGCGGCA AGGCTGAGTT GGCATTACAG TATAAAAAGT TACGTTTCGG TAAGACGGGC GGGGTCCGTA ATGCTGATGT AGACTTTTAC AATTATCTCC TTGAGCAGGA GTGGTACGAA AATAGTCTTA TTGCTGAAGT TTACGACCAA TATACCGCGA ACACACGTTT TCGTTATAGT TTTATTACTA ACCAGCGTCG ATACGATCGT TTATCTGAAA AAGATAGCAA CGAGTACTAT TTGCGCTATG GAGTAAAGTC CCAATTATCG GACAAAACAA ACATCGACAT GAATGTCGCG TGGTTGTATA AGACGTTTGA GAATAATGCC AACGCTCGAA ATTTTAATGG ATTGAATTGG GACATACAGG GAGAGTGGAA GCCGCTAAAA CAGTCTGTTG TTACGCTACA TACTTCACAG AATATTAAAG ATCCGTCCGA GGTGGGTGGA TATATTTTAT TCACCAAATA TGGTGTTTCG TACCAACACT TTTGGTTGGG GGATCGTTTT TCTACTACCC TGGATTATTC TCTTAGCCGG GAGGATTACA AAAAACAGGA CAAAAATCGT CGTGACAGGA ATGGCGTGTT TACGATGAAA ATGAGTTACG ACTATACCCC TTCGGTTAAC GTTGAACTCA AGTATCTTCT GAATAAGTTG GATTCGAATA AGAATACGGA TTCGTTCTAC ATTGGACCTA ACGACGAGCG GGAAGTAACA AGGACGTTAG GTTATGACAA TTCAATGATT ATGCTTACTG CTAAGGTTCA GATATAA
|
Protein sequence | MRIKLLAITG IVILQPVWAE PIPKSHTGIA GVDFQSDVAA NYGYSDNITF QPHSSKEKDS VFQSITPTLS MVGERFQDKY LLMYSGDYRR YDKDSADNYN DHFFRFNGAW RYGLKHGLTL NLEDSLGHEE RGRGITEGFV QEQFRQFGIT SPLSPHFVNS ELRYSYGAPE GRGKAELALQ YKKLRFGKTG GVRNADVDFY NYLLEQEWYE NSLIAEVYDQ YTANTRFRYS FITNQRRYDR LSEKDSNEYY LRYGVKSQLS DKTNIDMNVA WLYKTFENNA NARNFNGLNW DIQGEWKPLK QSVVTLHTSQ NIKDPSEVGG YILFTKYGVS YQHFWLGDRF STTLDYSLSR EDYKKQDKNR RDRNGVFTMK MSYDYTPSVN VELKYLLNKL DSNKNTDSFY IGPNDEREVT RTLGYDNSMI MLTAKVQI
|
| |