Gene EcolC_2761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2761 
Symbol 
ID6064783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3032686 
End bp3034107 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content53% 
IMG OID641602167 
ProductPhage-related tail fibre protein-like protein 
Protein accessionYP_001725716 
Protein GI170020762 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000146367 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACAA AATTTTATAC CCTGCTGACG GATATTGGCG CGGCGAAACT TGCCAGCGCC 
GCCGCGCTCG GTGTGCCGCT AAAAATTACC CATATGGCGG TGGGCGATGG CGGTGGAGTA
TTGCCAACGC CGGACGCAAA GCAGACGGCA CTGGTAAATG AGAAACGCCG GGCTGCGCTG
AATATGCTTT ATATCGACCC GCAGAACAGC AGCCAGATTA TTGCCGAACA GGTGATCCCT
GAAAACGAGG GCGGTTGGTG GATACGTGAA GTGGGCTTGT TTGATGAGTC CGGGGCATTG
ATTGCCGTGG GCAACTGCCC GGAAAGCTAT AAGCCGCAAC TGGCTGAAGG TAGCGGGCGC
ACTCAGACCG TGCGCATGGT GCTGATTACC AGCAGCACGG ACAATATCAC CCTGAAAATC
GACCCTGCTG TAGTGCTGGC AACCCGCAAG TATGTGGATG ACAAGGCACT GGAGCTGAAG
GTGTACGCGG ATGATCAGAT GGCAAAACAT CTTGCCGCAC CGGACCCGCA TTCACAGTAC
GCGCCAAAAG CCAGCCCGAC ATTTACCGGA ACCCCCAAAG CGCCAACGCC AGCGGCGGGG
AATAATACCA CGCAGGTTGC GACCACTGCG TTTGTACAGG CGGCACTGAC GGCCCTTATT
AATGGTGCGC CAGCCACGCT GGACACGCTG AAAGAAATAG CCGCAGCCAT TAACAATGAT
CCGAATTTCA GTACCACCAT TAACAATGCG CTGGCACTAA AAGCACCGTT GTCGAGTCCG
GCACTCACCG GAACGCCAAC AGCCCCCACG GCGGCGCAGT CGGTCAACAA TACACAGATT
GCCACCACGG CATTTGTGAA ATCGGCGATT GCGGCAATGG TGGGTTCTGC ACCTGCGGCA
CTGGATACAC TGAACGAACT GGCGGCGGCA CTGGGGAATG ATCCGAACTT TGCCACGACA
ATGCTTAATG CGCTGTCAGG TAAACAACCG CTGGACAATA CGCTTACCAA TTTGAGTGGA
AAGGATGTAG CTGGTCTTCT CACATACCTT GGTTTGGGAG AGGCGGCAAA ACGGGATGTG
GGCACAGGGG AAAATCAGAT ACCGGACATG GTTTCATTTA GTGGGGTGAG GGATTATTAC
GGAAAACAAC TTTTGCCAGG AGGGTTGATA CTCCAGTGGC TGACGATTCC ATCAAGTGCA
GCAGCCAAAG CTGTAACACT GAATAATGGT AATTATCAGC TGTCAGGCTA TAAATGGCCC
CAGTCATTTG GTGTCCTGTT TGCTGTGTTT GCTACAAAAG TTTCTGGCTC GACTAACGAA
GCATACGCAA TCTCAGTTAA TCGTCACTCT ACCGATGTAA TTGTCACCTG GAATGCCCGT
AAGGCTGATG ACGTCCACAT TTTAGGAATT GGGAAATTAT GA
 
Protein sequence
MSTKFYTLLT DIGAAKLASA AALGVPLKIT HMAVGDGGGV LPTPDAKQTA LVNEKRRAAL 
NMLYIDPQNS SQIIAEQVIP ENEGGWWIRE VGLFDESGAL IAVGNCPESY KPQLAEGSGR
TQTVRMVLIT SSTDNITLKI DPAVVLATRK YVDDKALELK VYADDQMAKH LAAPDPHSQY
APKASPTFTG TPKAPTPAAG NNTTQVATTA FVQAALTALI NGAPATLDTL KEIAAAINND
PNFSTTINNA LALKAPLSSP ALTGTPTAPT AAQSVNNTQI ATTAFVKSAI AAMVGSAPAA
LDTLNELAAA LGNDPNFATT MLNALSGKQP LDNTLTNLSG KDVAGLLTYL GLGEAAKRDV
GTGENQIPDM VSFSGVRDYY GKQLLPGGLI LQWLTIPSSA AAKAVTLNNG NYQLSGYKWP
QSFGVLFAVF ATKVSGSTNE AYAISVNRHS TDVIVTWNAR KADDVHILGI GKL