Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2093 |
Symbol | |
ID | 6067303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2289083 |
End bp | 2290684 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641601501 |
Product | lambda family phage portal protein |
Protein accession | YP_001725060 |
Protein GI | 170020106 |
COG category | [R] General function prediction only |
COG ID | [COG5511] Bacteriophage capsid protein |
TIGRFAM ID | [TIGR01539] phage portal protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0127798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACGT CCACCATTCC CACCCTTCTG GGGCCGGACG GCATGACATC GCTGCGTGAA TATGCCGGTT ATCACGGCGG TGGCAGCGGA TTTGGTGGGC AGTTGCGGGC GTGGAACCCA CCGGGTGAAA GTGTGGATGC AGCCCTGCTG CCCAACTTTA CCCGTGGCAA TGCCCGCGCA GACGATCTGG TACGCAATAA CGGCTATGCC GCCAACGCCA TCCAGTTGCA TCAGGATCAT ATCGTCGGGT CTTTTTTCCG GCTCAGTCAT CGCCCAAGCT GGCGCTATCT GGGCATCGGG GAGGAAGAAG CCCGTGCCTT TTCCCGCGAG GTTGAAGCGG CATGGAAAGA GTTTGCCGAG GATGACTGCT GCTGCATTGA CGTTGAGCGA AAACGCACGT TTACCATGAT GATTCGGGAA GGTGTGGCCA TGCACGCCTT TAACGGTGAA CTGTTCGTTC AGGCCACCTG GGATACCAGT CCGTCGCGGC TTTTCCGGAC ACAGTTCCGG ATGGTCAGCC CGAAGCGCAT CAGCAACCCG AACAATACCG GCGACAGCCG GAACTGCCGT GCCGGTGTGC AGATTAATGA CAGCGGTGCG GCGCTGGGAT ATTACGTCAG CGAGGACGGG TATCCTGGCT GGATGCCGCA GAAATGGACA TGGATACCCC GTGAGTTACC CGGCGGGCGC GCCTCGTTCA TTCACGTTTT TGAACCCGTG GAGGACGGGC AGACCCGCGG TGCAAATGTG TTTTACAGCG TGATGGAGCA GATGAAGATG CTTGACACGC TGCAGAACAC GCAGCTGCAG AGCGCCATTG TGAAGGCGAT GTATGCCGCC ACCATTGAGA GTGAGCTGGA TACGCAGTCA GCGATGGATT TTATTCTGGG CGCGAACAGT AAGGAGCAGC GGGACAAGCT GACCGGCTGG ATTGGTGAAA TTGCCGCGTA TTACGCCGCA GCACCGGTCC GTCTGGGAGG CGCAAAAGTG CCGCACCTGA TGCCGGGGGA CTCACTGAAC CTGCAGACGG CTCAGGACAC GGATAACGGC TACTCCGTGT TTGAGCAGTC ACTGTTGCGG TATATCGCTG CCGGGCTGGG TGTCTCGTAT GAGCAGCTTT CCCGGAATTA CGCCCAGATG AGCTACTCCA CGGCACGGGC CAGTGCGAAC GAGTCGTGGG CGTACTTTAT GGGGCGGCGA AAATTCGTCG CATCCCGTCA GGCGAGCCAG ATGTTTCTGT GCTGGCTGGA AGAGGCCATC GTTCGCCGCG TGGTGACGTT ACCTTCAAAA GCGCGCTTCA GTTTTCAGGA AGCCCGCAGT GCCTGGGGGA ACTGCGACTG GATAGGCTCC GGTCGTATGG CCATCGATGG TCTGAAAGAA GTTCAGGAAG CGGTGATGCT GATAGAAGCC GGACTGAGCA CCTACGAGAA AGAGTGCGCG AAACGCGGTG ACGACTATCA GGAAATTTTT GCCCAGCAGG TCCGTGAAAC GATGGAGCGC CGCGCAGCCG GTCTTAAACC GCCCGCCTGG GCGGCTGCAG CATTTGAATC CGGGCTGCGA CAATCAACAG AGGAGGAGAA GAGTGACAGC AGAGCTGCGT AA
|
Protein sequence | MKTSTIPTLL GPDGMTSLRE YAGYHGGGSG FGGQLRAWNP PGESVDAALL PNFTRGNARA DDLVRNNGYA ANAIQLHQDH IVGSFFRLSH RPSWRYLGIG EEEARAFSRE VEAAWKEFAE DDCCCIDVER KRTFTMMIRE GVAMHAFNGE LFVQATWDTS PSRLFRTQFR MVSPKRISNP NNTGDSRNCR AGVQINDSGA ALGYYVSEDG YPGWMPQKWT WIPRELPGGR ASFIHVFEPV EDGQTRGANV FYSVMEQMKM LDTLQNTQLQ SAIVKAMYAA TIESELDTQS AMDFILGANS KEQRDKLTGW IGEIAAYYAA APVRLGGAKV PHLMPGDSLN LQTAQDTDNG YSVFEQSLLR YIAAGLGVSY EQLSRNYAQM SYSTARASAN ESWAYFMGRR KFVASRQASQ MFLCWLEEAI VRRVVTLPSK ARFSFQEARS AWGNCDWIGS GRMAIDGLKE VQEAVMLIEA GLSTYEKECA KRGDDYQEIF AQQVRETMER RAAGLKPPAW AAAAFESGLR QSTEEEKSDS RAA
|
| |