Gene EcolC_2756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2756 
Symbol 
ID6065615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3029254 
End bp3030426 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content57% 
IMG OID641602162 
Producttail sheath protein 
Protein accessionYP_001725711 
Protein GI170020757 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000512477 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTCAGG ATTACCACCA CGGAGTGCGC GTTGTTGAAG TCAACGAAGG CACCCGATCC 
ATTACCACGG TGAGCACCGC CATCGTGGGT ATGGTCTGCA CGGGCGATGA TGCCGATGCA
AAAATGTTTC CTCTTAATAA ACCCGTGCTG ATCACTGATG TGCTGACTGC CAGCGGTAAA
GCGGGTGAGT CCGGCACGCT GGCCCGTTCG CTGGATGCCA TCGCTGACCA GGCAAAACCC
GTGACCGTTG TTGTGCGTGT GCCGCAGGGG GAAACGGAAG AAGAAACCAC GACCAATATC
ATCGGCGCAG TGACTGCTGA AGGTAAAAAA ACAGGCATGA AAGCTCTGTT ATCTGCCCAG
TCACAGCTCG GCGTTAAACC GCGCATTCTC GGCGTGCCAG GCCACGATAA CAAAGCCGTT
GCGACTGAGT TGCTGAGCGT GGCGCAAAGC CTGCGTGGGT TTGCTTACCT GTCAGCGTAT
GGCTGCAAGA CAGTGCAGGA GGCGATCACT TACCGCGAAA ACTTCAGCCA GCGCGAAGGG
ATGCTGATCT GGCCCGACTT TACTGGCTGG GACACGGTGC TGAATGCCGA AGCAACGGCT
TATGCCACCG CTCGTGCGCT TGGTCTGCGC GCCAAAATTG ACGAGCAGAC CGGATGGCAC
AAAAGCCTGT CCAACGTGGG CGTGAACGGT GTCACCGGAA TTTCTGCTGA TGTGTTCTGG
GATCTGCAGG ACCCGGCAAC CGATGCAGGT CTGCTGAACC AGAACGACGT CACCACGCTT
GTGCGTAAAG ACGGTTTCCG CTTCTGGGGT TCCCGCAGCC TGAGTGATGA CCCGCTCTTT
GCCTTCGAAA ACTACACCCG CACGGCGCAG GTGCTGATGG ACACGATGGC AGAAGCACAC
ATGTGGGCGG TGGATAAACC GCTTAACCCG TCGCTGGCCC GCGACATTAT CGAGGGCATC
CGCGCCAAAA TGCGCAGCCT GGTCAGTCAG GGGTATCTCA TTGGTGGTGA TTGCTGGCTG
GACGAGTCGG TGAACGACAA AGACACTCTG AAAGCCGGAA AACTCACCAT CGACTACGAC
TACACGCCAG TGCCGCCACT TGAAAACCTG ATGTTGCGTC AGCGCATCAC CGATCAGTAC
CTGGTGAATT TCTCCAGCCA GGTCAGCGCG TAA
 
Protein sequence
MAQDYHHGVR VVEVNEGTRS ITTVSTAIVG MVCTGDDADA KMFPLNKPVL ITDVLTASGK 
AGESGTLARS LDAIADQAKP VTVVVRVPQG ETEEETTTNI IGAVTAEGKK TGMKALLSAQ
SQLGVKPRIL GVPGHDNKAV ATELLSVAQS LRGFAYLSAY GCKTVQEAIT YRENFSQREG
MLIWPDFTGW DTVLNAEATA YATARALGLR AKIDEQTGWH KSLSNVGVNG VTGISADVFW
DLQDPATDAG LLNQNDVTTL VRKDGFRFWG SRSLSDDPLF AFENYTRTAQ VLMDTMAEAH
MWAVDKPLNP SLARDIIEGI RAKMRSLVSQ GYLIGGDCWL DESVNDKDTL KAGKLTIDYD
YTPVPPLENL MLRQRITDQY LVNFSSQVSA