Gene EcolC_2093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2093 
Symbol 
ID6067303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2289083 
End bp2290684 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content58% 
IMG OID641601501 
Productlambda family phage portal protein 
Protein accessionYP_001725060 
Protein GI170020106 
COG category[R] General function prediction only 
COG ID[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0127798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGT CCACCATTCC CACCCTTCTG GGGCCGGACG GCATGACATC GCTGCGTGAA 
TATGCCGGTT ATCACGGCGG TGGCAGCGGA TTTGGTGGGC AGTTGCGGGC GTGGAACCCA
CCGGGTGAAA GTGTGGATGC AGCCCTGCTG CCCAACTTTA CCCGTGGCAA TGCCCGCGCA
GACGATCTGG TACGCAATAA CGGCTATGCC GCCAACGCCA TCCAGTTGCA TCAGGATCAT
ATCGTCGGGT CTTTTTTCCG GCTCAGTCAT CGCCCAAGCT GGCGCTATCT GGGCATCGGG
GAGGAAGAAG CCCGTGCCTT TTCCCGCGAG GTTGAAGCGG CATGGAAAGA GTTTGCCGAG
GATGACTGCT GCTGCATTGA CGTTGAGCGA AAACGCACGT TTACCATGAT GATTCGGGAA
GGTGTGGCCA TGCACGCCTT TAACGGTGAA CTGTTCGTTC AGGCCACCTG GGATACCAGT
CCGTCGCGGC TTTTCCGGAC ACAGTTCCGG ATGGTCAGCC CGAAGCGCAT CAGCAACCCG
AACAATACCG GCGACAGCCG GAACTGCCGT GCCGGTGTGC AGATTAATGA CAGCGGTGCG
GCGCTGGGAT ATTACGTCAG CGAGGACGGG TATCCTGGCT GGATGCCGCA GAAATGGACA
TGGATACCCC GTGAGTTACC CGGCGGGCGC GCCTCGTTCA TTCACGTTTT TGAACCCGTG
GAGGACGGGC AGACCCGCGG TGCAAATGTG TTTTACAGCG TGATGGAGCA GATGAAGATG
CTTGACACGC TGCAGAACAC GCAGCTGCAG AGCGCCATTG TGAAGGCGAT GTATGCCGCC
ACCATTGAGA GTGAGCTGGA TACGCAGTCA GCGATGGATT TTATTCTGGG CGCGAACAGT
AAGGAGCAGC GGGACAAGCT GACCGGCTGG ATTGGTGAAA TTGCCGCGTA TTACGCCGCA
GCACCGGTCC GTCTGGGAGG CGCAAAAGTG CCGCACCTGA TGCCGGGGGA CTCACTGAAC
CTGCAGACGG CTCAGGACAC GGATAACGGC TACTCCGTGT TTGAGCAGTC ACTGTTGCGG
TATATCGCTG CCGGGCTGGG TGTCTCGTAT GAGCAGCTTT CCCGGAATTA CGCCCAGATG
AGCTACTCCA CGGCACGGGC CAGTGCGAAC GAGTCGTGGG CGTACTTTAT GGGGCGGCGA
AAATTCGTCG CATCCCGTCA GGCGAGCCAG ATGTTTCTGT GCTGGCTGGA AGAGGCCATC
GTTCGCCGCG TGGTGACGTT ACCTTCAAAA GCGCGCTTCA GTTTTCAGGA AGCCCGCAGT
GCCTGGGGGA ACTGCGACTG GATAGGCTCC GGTCGTATGG CCATCGATGG TCTGAAAGAA
GTTCAGGAAG CGGTGATGCT GATAGAAGCC GGACTGAGCA CCTACGAGAA AGAGTGCGCG
AAACGCGGTG ACGACTATCA GGAAATTTTT GCCCAGCAGG TCCGTGAAAC GATGGAGCGC
CGCGCAGCCG GTCTTAAACC GCCCGCCTGG GCGGCTGCAG CATTTGAATC CGGGCTGCGA
CAATCAACAG AGGAGGAGAA GAGTGACAGC AGAGCTGCGT AA
 
Protein sequence
MKTSTIPTLL GPDGMTSLRE YAGYHGGGSG FGGQLRAWNP PGESVDAALL PNFTRGNARA 
DDLVRNNGYA ANAIQLHQDH IVGSFFRLSH RPSWRYLGIG EEEARAFSRE VEAAWKEFAE
DDCCCIDVER KRTFTMMIRE GVAMHAFNGE LFVQATWDTS PSRLFRTQFR MVSPKRISNP
NNTGDSRNCR AGVQINDSGA ALGYYVSEDG YPGWMPQKWT WIPRELPGGR ASFIHVFEPV
EDGQTRGANV FYSVMEQMKM LDTLQNTQLQ SAIVKAMYAA TIESELDTQS AMDFILGANS
KEQRDKLTGW IGEIAAYYAA APVRLGGAKV PHLMPGDSLN LQTAQDTDNG YSVFEQSLLR
YIAAGLGVSY EQLSRNYAQM SYSTARASAN ESWAYFMGRR KFVASRQASQ MFLCWLEEAI
VRRVVTLPSK ARFSFQEARS AWGNCDWIGS GRMAIDGLKE VQEAVMLIEA GLSTYEKECA
KRGDDYQEIF AQQVRETMER RAAGLKPPAW AAAAFESGLR QSTEEEKSDS RAA