Gene Msil_2179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2179 
Symbol 
ID7093400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2351061 
End bp2352182 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content64% 
IMG OID643465503 
Productphage major capsid protein, HK97 family 
Protein accessionYP_002362479 
Protein GI217978332 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGCGC CCGGCGCTCA ATTCACACAA AAGACGGCCG ATTCGGCCGA CGCGATCTTC 
GCGGCGATCG ATCGCCTCGA CGGCGACGCT GAGGCCAAGC GCGCCCATGA CGAGATACGC
GCGGCGCGAT CGGCCTTCGT GAAGGCGCTT CGCCTCGGCC ACAAGGCGCT CTCGGCGCAG
GAGGCGCGGC ATTTGGGCGA CGCGCAACAA GCGCGCGTTC TCAACATCGC GGAAGCCACG
CCCCTCGCCG GCGGCTATCT CCTGCCGGCG CCCGTCGCCG CGAGTCTTCT GCGGCGCGTC
GAACTCTACA CCCCTGTTGC GTCGGTGGCG CGCGTCATCA CGACCGATAC CGGCGGTCCG
CTTTCCTGGC CCTTGGTCGA CGAGGCTTCT ATGGGCGCAG GCATTGTCGC CGAGAACAGC
ACGCTCAATG CGGTGGATAT GCCAGTCGGA ACGCTTGGGC TGAACGCCAG TAAATTCTCG
TCGGGAATCA TTCCTGTTTC GCTGGAGGTC TTGCAGGACA GCGCCGTAAA CATCGAGGAT
ATGCTGCTCG ATTTGTTGGC TGCGCGCCTT GCGCGAGGCA TGAACAGTTT CTTTACTAGC
GGAACGGGCG TAAACCAGCC GCAGGGCGCG CTGACCGGCG CCAGCCTTGG CGTAACGCTC
CCGGCGGCCA ACACAACGTC GCTGACCTCG GACGGCTTGA TCGCGCTCTA TTCGAGCGTC
GACGCCGCCT ATCGCCAGAG CTTGCGTTGC GTTTGGATGA TGAACGATAT GACATTGCTG
GCGGTGCAGA AGATCGCCGC GGCGCAAGGC TGGCCGCTAT GGTTTCCCGA TCCGCTTCTC
CAACCCGGCG GACCGCCGCC GCAGGGGCGA CTGTTCGGCC GTCCCGTCGT CATCAACAAT
GATATGCCGG CGATGGCGGC GAGCGCCAAG CCGATTCTGT TCGGCGATTT TGCCTCTTTC
GTGGCGCGCT TCGTCAACGG CGCCGCGATC CTACGGATGA ATGATTCCGG CTATCTTTCG
AAAGGCCAGA CGGCCTTCGT CGCGTTCGCA CGAGCGGACA GCCGGGTCGC CAACTGGAGC
GCGGGAGCCG CGCTGAGATA TCTCCAGAAC AGCGCAAGCT GA
 
Protein sequence
MLAPGAQFTQ KTADSADAIF AAIDRLDGDA EAKRAHDEIR AARSAFVKAL RLGHKALSAQ 
EARHLGDAQQ ARVLNIAEAT PLAGGYLLPA PVAASLLRRV ELYTPVASVA RVITTDTGGP
LSWPLVDEAS MGAGIVAENS TLNAVDMPVG TLGLNASKFS SGIIPVSLEV LQDSAVNIED
MLLDLLAARL ARGMNSFFTS GTGVNQPQGA LTGASLGVTL PAANTTSLTS DGLIALYSSV
DAAYRQSLRC VWMMNDMTLL AVQKIAAAQG WPLWFPDPLL QPGGPPPQGR LFGRPVVINN
DMPAMAASAK PILFGDFASF VARFVNGAAI LRMNDSGYLS KGQTAFVAFA RADSRVANWS
AGAALRYLQN SAS