Gene EcolC_2096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2096 
Symbol 
ID6067285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2292382 
End bp2293407 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content56% 
IMG OID641601504 
Productmajor capsid protein E 
Protein accessionYP_001725063 
Protein GI170020109 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000203332 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATGT ACACAACCGC CCAGCTGCTG GCGGCAAATG AGCAGAAATT TAAGTTTGAT 
CCGCTGTTTC TGCGTCTCTT TTTCCGTGAG AGCTATCCCT TCACTACGGA GAAAGTCTAT
CTCTCACAAA TTCCGGGACT GGTAAACATG GCGCTGTACG TTTCGCCGAT TGTTTCCGGT
GAGGTTATCC GTTCCCGTGG CGGCTCCACC TCTGAATTTA CGCCGGGATA TGTCAAACCC
AAGCATGAGG TGAATCCGCA GATGACCCTG CGTCGCCTGC CGGATGAAGA TCCACAGAAT
CTGGCGGACC CGGCTTACCG CCGCCGTCGC ATCATCATGC AGAACATGCG AGACGAAGAG
CTGGCCATTG CTCAGGTCGA AGAGATGCAG GCAGTTTCTG CCGTGCTCAA GGGCAAATAC
ACCATGACCG GTGAAGCCTT CGATCCGGTT GAGGTGGATA TGGGCCGCAG TGCGGCGAAC
AACATCACGC AGTCCGGCGG CACGGAGTGG AGCAAGCGTG ACAAGTCCAC GTATGACCCG
ACCGACGATA TCGAAGCCTA CGCGCTGAAC GCCAGCGGCG TGGTGAATAT CATCGTGTTT
GATCCGAAAG GCTGGGCGCT GTTCCGTTCC TTCAAAGCCG TCAAGGAGAA GCTGGATACC
CGTCGCGGCT CTAATTCCGA GCTGGAGACA GCGGTAAAAG ACCTGGGCGA AGCGGTGTCC
TATAAGGGGA TGTATGGCGA TACGGCGATC GTCGTGTATT CCGGACAGTA CGTGGAAAAC
GACGTCAAAA AGAACTTCCT GCCGGACAAC ACGATGGTGC TGGGGAACAC TCAGGCACGC
GGTCTGCGCA CCTATGGCTG CATTCAGGAT GCGGACGCAC AGCGCGAAGG TATTAACGCC
TCTGCCCGCT ACCCGAAAAA CTGGGTGACC ACCGGCGATC CGGCGCGTGA GTTCACCATG
ATTCAGTCAG CACCGCTGAT GCTGCTGGCT GATCCTGATG CGTTCGTGTC CGTACAACTG
GCGTAA
 
Protein sequence
MSMYTTAQLL AANEQKFKFD PLFLRLFFRE SYPFTTEKVY LSQIPGLVNM ALYVSPIVSG 
EVIRSRGGST SEFTPGYVKP KHEVNPQMTL RRLPDEDPQN LADPAYRRRR IIMQNMRDEE
LAIAQVEEMQ AVSAVLKGKY TMTGEAFDPV EVDMGRSAAN NITQSGGTEW SKRDKSTYDP
TDDIEAYALN ASGVVNIIVF DPKGWALFRS FKAVKEKLDT RRGSNSELET AVKDLGEAVS
YKGMYGDTAI VVYSGQYVEN DVKKNFLPDN TMVLGNTQAR GLRTYGCIQD ADAQREGINA
SARYPKNWVT TGDPAREFTM IQSAPLMLLA DPDAFVSVQL A