Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2179 |
Symbol | |
ID | 7093400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 2351061 |
End bp | 2352182 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643465503 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_002362479 |
Protein GI | 217978332 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGCGC CCGGCGCTCA ATTCACACAA AAGACGGCCG ATTCGGCCGA CGCGATCTTC GCGGCGATCG ATCGCCTCGA CGGCGACGCT GAGGCCAAGC GCGCCCATGA CGAGATACGC GCGGCGCGAT CGGCCTTCGT GAAGGCGCTT CGCCTCGGCC ACAAGGCGCT CTCGGCGCAG GAGGCGCGGC ATTTGGGCGA CGCGCAACAA GCGCGCGTTC TCAACATCGC GGAAGCCACG CCCCTCGCCG GCGGCTATCT CCTGCCGGCG CCCGTCGCCG CGAGTCTTCT GCGGCGCGTC GAACTCTACA CCCCTGTTGC GTCGGTGGCG CGCGTCATCA CGACCGATAC CGGCGGTCCG CTTTCCTGGC CCTTGGTCGA CGAGGCTTCT ATGGGCGCAG GCATTGTCGC CGAGAACAGC ACGCTCAATG CGGTGGATAT GCCAGTCGGA ACGCTTGGGC TGAACGCCAG TAAATTCTCG TCGGGAATCA TTCCTGTTTC GCTGGAGGTC TTGCAGGACA GCGCCGTAAA CATCGAGGAT ATGCTGCTCG ATTTGTTGGC TGCGCGCCTT GCGCGAGGCA TGAACAGTTT CTTTACTAGC GGAACGGGCG TAAACCAGCC GCAGGGCGCG CTGACCGGCG CCAGCCTTGG CGTAACGCTC CCGGCGGCCA ACACAACGTC GCTGACCTCG GACGGCTTGA TCGCGCTCTA TTCGAGCGTC GACGCCGCCT ATCGCCAGAG CTTGCGTTGC GTTTGGATGA TGAACGATAT GACATTGCTG GCGGTGCAGA AGATCGCCGC GGCGCAAGGC TGGCCGCTAT GGTTTCCCGA TCCGCTTCTC CAACCCGGCG GACCGCCGCC GCAGGGGCGA CTGTTCGGCC GTCCCGTCGT CATCAACAAT GATATGCCGG CGATGGCGGC GAGCGCCAAG CCGATTCTGT TCGGCGATTT TGCCTCTTTC GTGGCGCGCT TCGTCAACGG CGCCGCGATC CTACGGATGA ATGATTCCGG CTATCTTTCG AAAGGCCAGA CGGCCTTCGT CGCGTTCGCA CGAGCGGACA GCCGGGTCGC CAACTGGAGC GCGGGAGCCG CGCTGAGATA TCTCCAGAAC AGCGCAAGCT GA
|
Protein sequence | MLAPGAQFTQ KTADSADAIF AAIDRLDGDA EAKRAHDEIR AARSAFVKAL RLGHKALSAQ EARHLGDAQQ ARVLNIAEAT PLAGGYLLPA PVAASLLRRV ELYTPVASVA RVITTDTGGP LSWPLVDEAS MGAGIVAENS TLNAVDMPVG TLGLNASKFS SGIIPVSLEV LQDSAVNIED MLLDLLAARL ARGMNSFFTS GTGVNQPQGA LTGASLGVTL PAANTTSLTS DGLIALYSSV DAAYRQSLRC VWMMNDMTLL AVQKIAAAQG WPLWFPDPLL QPGGPPPQGR LFGRPVVINN DMPAMAASAK PILFGDFASF VARFVNGAAI LRMNDSGYLS KGQTAFVAFA RADSRVANWS AGAALRYLQN SAS
|
| |