Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2096 |
Symbol | |
ID | 6067285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2292382 |
End bp | 2293407 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641601504 |
Product | major capsid protein E |
Protein accession | YP_001725063 |
Protein GI | 170020109 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000203332 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATGT ACACAACCGC CCAGCTGCTG GCGGCAAATG AGCAGAAATT TAAGTTTGAT CCGCTGTTTC TGCGTCTCTT TTTCCGTGAG AGCTATCCCT TCACTACGGA GAAAGTCTAT CTCTCACAAA TTCCGGGACT GGTAAACATG GCGCTGTACG TTTCGCCGAT TGTTTCCGGT GAGGTTATCC GTTCCCGTGG CGGCTCCACC TCTGAATTTA CGCCGGGATA TGTCAAACCC AAGCATGAGG TGAATCCGCA GATGACCCTG CGTCGCCTGC CGGATGAAGA TCCACAGAAT CTGGCGGACC CGGCTTACCG CCGCCGTCGC ATCATCATGC AGAACATGCG AGACGAAGAG CTGGCCATTG CTCAGGTCGA AGAGATGCAG GCAGTTTCTG CCGTGCTCAA GGGCAAATAC ACCATGACCG GTGAAGCCTT CGATCCGGTT GAGGTGGATA TGGGCCGCAG TGCGGCGAAC AACATCACGC AGTCCGGCGG CACGGAGTGG AGCAAGCGTG ACAAGTCCAC GTATGACCCG ACCGACGATA TCGAAGCCTA CGCGCTGAAC GCCAGCGGCG TGGTGAATAT CATCGTGTTT GATCCGAAAG GCTGGGCGCT GTTCCGTTCC TTCAAAGCCG TCAAGGAGAA GCTGGATACC CGTCGCGGCT CTAATTCCGA GCTGGAGACA GCGGTAAAAG ACCTGGGCGA AGCGGTGTCC TATAAGGGGA TGTATGGCGA TACGGCGATC GTCGTGTATT CCGGACAGTA CGTGGAAAAC GACGTCAAAA AGAACTTCCT GCCGGACAAC ACGATGGTGC TGGGGAACAC TCAGGCACGC GGTCTGCGCA CCTATGGCTG CATTCAGGAT GCGGACGCAC AGCGCGAAGG TATTAACGCC TCTGCCCGCT ACCCGAAAAA CTGGGTGACC ACCGGCGATC CGGCGCGTGA GTTCACCATG ATTCAGTCAG CACCGCTGAT GCTGCTGGCT GATCCTGATG CGTTCGTGTC CGTACAACTG GCGTAA
|
Protein sequence | MSMYTTAQLL AANEQKFKFD PLFLRLFFRE SYPFTTEKVY LSQIPGLVNM ALYVSPIVSG EVIRSRGGST SEFTPGYVKP KHEVNPQMTL RRLPDEDPQN LADPAYRRRR IIMQNMRDEE LAIAQVEEMQ AVSAVLKGKY TMTGEAFDPV EVDMGRSAAN NITQSGGTEW SKRDKSTYDP TDDIEAYALN ASGVVNIIVF DPKGWALFRS FKAVKEKLDT RRGSNSELET AVKDLGEAVS YKGMYGDTAI VVYSGQYVEN DVKKNFLPDN TMVLGNTQAR GLRTYGCIQD ADAQREGINA SARYPKNWVT TGDPAREFTM IQSAPLMLLA DPDAFVSVQL A
|
| |