Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_2598 |
Symbol | |
ID | 5113766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 2801094 |
End bp | 2802323 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640492788 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001177317 |
Protein GI | 146312243 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00517058 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAATTGC ACGAACTGAA GCAAAAACGT AACACCATCG CCACTGATAT GCGCGCACTG AACGATAAAA TTGGCGATAC GACATGGACT GAAGAGCAGC GCACTCAGTG GAATGCTGCA AAGTCGGAGC TGGACGCGCT TGATGTGCGC ATTGCCCGTG AAGACGAGCT GCGCCGTCAG GATCAGGACT ACGTTGACGA AAATGAAAAG GAAAATCGTC ATCAGCAGAA TAACGACCCT GCGAACCCCG ATGCGAAAGC AGGTGAACGT CGCGCTGCCG CATTTGATCG CTTCCTTCGC CACGGCTTCA GCGAGCTGAG TGCGGAAGAG CGCCAGGCTG TAAAAGAGTT GCGTGCTCAG GGAACGTCAC CTGATGCGAA GGGTGGCTAC ACCGTACCAA CACAAATGCT GAATAAAATT GTCGACTCCA TGAAAGCCTA TGGTGGTATC GCCAGCGTTG CTCAAATCCT CAATACCTCA ACCGGCCAGG ATATCACCTG GTCAACGTCA GATGGCACGG CGGAAGAGGG CGAACTGTTG GGTGAAAACA CAGAAACCAC CGATGAAGAT GTGACGTTCG GCACCGCCGT CCTGGGTGCT AAAAAGCTGT CATCCAAAAT CATCAAAGTT TCTAACGAGC TGCTACAGGA CAGCGGCGTG GATATTGAAG CATATCTTGC ATCGCGTATC GGTCAGCGTA TTGGTCGGGG TGAAGCTAAA TATCTGGTGC AGGGCACCGG GGCAGGCGCA CCGGTACAGC CAAAAGGCCT GGTTGCTTCC GTAACCGGAA CGGTAAACAC GGCTGCTGCA GCTGCGTTCA CCTGGCAGGA AATGAATAAG CTGAAACATG CGATCGATCC GGCGTATCGC AGTGGGCCAA AATTCCGCTG GGCGTTCAAC GATTCAACTC TTCAGGTTAT TGAAGAGATG GTCGATGACC AGAAGCGTCC GCTCTGGCTC CCTGACGTAG TCGGCGGTAC TCCAGCCACG ATCCTTAACA TTCCTTACGT CATTGATCAG GCGATTGATG GTATTGCGGC GGGTAAAAAA TTCGCCTTCC TGGGTGATTT TGATCGCTTC ATTGTCCGTC GCGTTGCCTA CATGACACTG AAACGTCTGG TAGAGCGTTA CGCGGAATAT GATCAGACCG CGTTCCTGGC ATTCCATCGC TTCGATTGCG TTCTGGAAGA TACTGCAGCC ATCAAAGCGC TGGTGGGCAA GCCAGCATAA
|
Protein sequence | MKLHELKQKR NTIATDMRAL NDKIGDTTWT EEQRTQWNAA KSELDALDVR IAREDELRRQ DQDYVDENEK ENRHQQNNDP ANPDAKAGER RAAAFDRFLR HGFSELSAEE RQAVKELRAQ GTSPDAKGGY TVPTQMLNKI VDSMKAYGGI ASVAQILNTS TGQDITWSTS DGTAEEGELL GENTETTDED VTFGTAVLGA KKLSSKIIKV SNELLQDSGV DIEAYLASRI GQRIGRGEAK YLVQGTGAGA PVQPKGLVAS VTGTVNTAAA AAFTWQEMNK LKHAIDPAYR SGPKFRWAFN DSTLQVIEEM VDDQKRPLWL PDVVGGTPAT ILNIPYVIDQ AIDGIAAGKK FAFLGDFDRF IVRRVAYMTL KRLVERYAEY DQTAFLAFHR FDCVLEDTAA IKALVGKPA
|
| |