Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3176 |
Symbol | |
ID | 6065747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3481272 |
End bp | 3482483 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641602592 |
Product | putative capsular polysaccharide bisynthesis protein |
Protein accession | YP_001726126 |
Protein GI | 170021172 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTAT TGAAGAGTGC TTCCACTATT GCCGGGTCGT CAGTCATTTC GCAACTGATT GGCGCCTTCT CAATCTGGTT AATCTCGTAT AAATACGATC TCGCGGAAGT GGGCCTGTAT GCGCTCAATT ACAGTATCGC TGTGATCGGA GCGCAGGTAT GTACCTTTGC GTCCCAATTG CTTATTCCTA AACAATCGGA AGAAGAGTTA ACGCAAAATG TGGTGTTTTG CCTTCTGCAA AGCGCGATTC TGGCGCTGCC TTACGCGCTA CTGACGGCGT GGCTATTCCA TCAAAACGTG CTGTTTCTCT ATCTGTTATC GCTGTCGAAC GCATGGGTTC TGATATCGGA AAACCTGTCA CTGCGCACGG GTAATTTCCG GTTTCTCATC TTCCAGCGTA TTTCGGTGTC AGTCGTCGTG GTGCTGTCTG TTGTACTGAC GAATCAGGTT CAGATGTTTT ACTGGACCTG GGCCAGCGGC ATGATGCTGC TAACCATCAG TTGTATCGCG CGATCTTTTA ATTTCCGTAC GGTCACGGCG CAGCATCTGT CCCTGAAAAG CAACCTGGTG TTTTTCAGGA CGCATTTTCA CCATATTTCG CGAGTCGGGA GTGCCGAAGT ATTGGCGATG GCTAACAATA ATCTGCCGAT CATGCTCATA AACTTTTGGT TTTCGGCATT AACGGCGGGC TATTTTTCCG TGGTCAGTCG CTTCTGTCTG TCACCGGTGA CCATTGTCGG GAATGCGGTG CGTAATACCA TTTTCTCAAA GTGGTCGATC GACTTCAGAA ATAACACGTT TAACTATCCG GAATACCAGC GGGTTCGTTT CCTGCTGATG GTGCTCGGGG TGATATGTAC CCTGGGGGTG TTTATTTTTT ATCCTATTGT GATGCATCTT GGTTTCGGTG AGGACTGGAT CAATTCCATT GATACCTCGC GTTATATGCT GCCCTATCTA TTCCCGGCGC TGGCGGTCAG TCCTCTCACC GTCATCGAAC TGATTTTTGG ATCACACCGA TATTTCCTGC GTATTCAGCT GGAACAACTG GCAATCGTAC TGATCGCTTT TGTAGTGACG CCTTATTTCT ATAACGACTA TGCCACCTCG GTGATTATTT TCTCCGTTCT GACGTTTATT CGTTATGCAT TCATTTATCT GGCGATGAAT AAACGTGCGA CGCTTTTGAA GAACAACCCG GTGATGCCAT GA
|
Protein sequence | MSLLKSASTI AGSSVISQLI GAFSIWLISY KYDLAEVGLY ALNYSIAVIG AQVCTFASQL LIPKQSEEEL TQNVVFCLLQ SAILALPYAL LTAWLFHQNV LFLYLLSLSN AWVLISENLS LRTGNFRFLI FQRISVSVVV VLSVVLTNQV QMFYWTWASG MMLLTISCIA RSFNFRTVTA QHLSLKSNLV FFRTHFHHIS RVGSAEVLAM ANNNLPIMLI NFWFSALTAG YFSVVSRFCL SPVTIVGNAV RNTIFSKWSI DFRNNTFNYP EYQRVRFLLM VLGVICTLGV FIFYPIVMHL GFGEDWINSI DTSRYMLPYL FPALAVSPLT VIELIFGSHR YFLRIQLEQL AIVLIAFVVT PYFYNDYATS VIIFSVLTFI RYAFIYLAMN KRATLLKNNP VMP
|
| |