Gene Mnod_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1996 
Symbol 
ID7305185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp2097492 
End bp2098913 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content69% 
IMG OID643599731 
Productphage portal protein, HK97 family 
Protein accessionYP_002497286 
Protein GI220921985 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0457981 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGCTGC TCGCCTCGCT CTTCGGCGCT GCGGCCGCGC CCCGCTTCGG CCCGTCAGCC 
TCCGTCTCGG ACGGCGGCTG GCTCATCCGC GCCATCGGTG GCGGGCGCAC CGCCGCTGGC
ACGGTCGTGA CCGAGCACTC GGCCCTGCGG CTGCCGGTGG TCTACGCCTG CGTCAACCGC
ATCTCGAACC CGCTGGCCCG CTTCCCGATC AAGATCATGA AGCCCCGCGC CGGCGGGGGC
AGCGAGGAGG TGACGGACCA TCCCCTGTCG CGCCGGCTCG GCCTGCGGCC CAACGACTTC
ATGTCGTCGC GCACCCTGCG CAAGACGGCG CAGGCTCATG CTCTCCTGTG GGGCAACGGC
TACATGGAGA TCGAGCGCAA CGGCCGCGGG CAGGCGGTCG GTCTCTGGCC GCTCCTGCCC
TGGGCCACGC AGCCGGTGCG CGAGGACGGC GTGCTGGTCT ACCGGACCAC CATCGACGGG
CAGACCTTCC GCCTCGACCA CGAGGACGTC CTGCACATCA TGGACCTCAG CCAGGACGGC
TATGTGGGGC ATTCGCCGGT GGCACTGGCC CGCGAGGCCT TGGGGCTCGC GCAGGCCCTT
GAGCAGTTCG GCGGCAAGTT CTTCGCCAAC GATGCCAAGA GCGGCGGCTT CCTCCTGCAT
CCCGGCCGGC TCTCGGCCGG CGCACAGGCG AACCTGAGGG CGCAGGGACC GCGCGGGCAG
CGCGACCCGA ACGCTCCGCG GGTCGAGCCG GGGCGCACCG ACCCCGGCGC GATGCTGGAG
CGCCAGGGCG GCCTCGACAA CGCGCACCGG GTCAAGGTGC TCGAGGAGGG CATGAAGTAC
ATCCAGACGA CGATCCCGCC CGAGGATGCG CAGTTCCTCG GCACCCGCGA GATGCAGATC
GCGGAAATCG CGCGGATGTA CGATGTGCCG CTGATCCTGC TGCAGAGCCA CGAGAAGACG
ACGTCGTGGG GCTCCGGCAT CGAGCAGCTG ATGATCGGCT TCGTCCGTCA GACCGTCGAG
CCCTGGGTGA ATGCCTGGGA GCAGGAGATG AACTGGAAGC TCTTCACGGA AGAGGAGCGA
AAGCAGGGAT ACTTCGTCAA GTTCAACATG AACGCGCTCC TGCGCGGCGA CATGATGAGC
CGGGCCCGGT TCTACCAGCT TCTGTTCGGC GTGGGCGGCC TCTCGCCCAA TGATATCCTG
ACGCTGGAGG ACATGGACCC GCTCGGCCCC GAAGGCGATC ACCACTTCGT GCCGGTCAAC
ATGCACACCC TCAAGAACGC GATCGACACC GTCGGCGTGC CCCAGGGCGG TGCCGTGCCT
CCCGATCCGA CCCAGGAGGC GCGGCTGGCC GCCGTGGAGG GGCGCGTGGA CGAGCTCGAC
GTCATCGCTG CCCGTCTCGA CGCTTTGGAG CGCGCCGCAT GA
 
Protein sequence
MGLLASLFGA AAAPRFGPSA SVSDGGWLIR AIGGGRTAAG TVVTEHSALR LPVVYACVNR 
ISNPLARFPI KIMKPRAGGG SEEVTDHPLS RRLGLRPNDF MSSRTLRKTA QAHALLWGNG
YMEIERNGRG QAVGLWPLLP WATQPVREDG VLVYRTTIDG QTFRLDHEDV LHIMDLSQDG
YVGHSPVALA REALGLAQAL EQFGGKFFAN DAKSGGFLLH PGRLSAGAQA NLRAQGPRGQ
RDPNAPRVEP GRTDPGAMLE RQGGLDNAHR VKVLEEGMKY IQTTIPPEDA QFLGTREMQI
AEIARMYDVP LILLQSHEKT TSWGSGIEQL MIGFVRQTVE PWVNAWEQEM NWKLFTEEER
KQGYFVKFNM NALLRGDMMS RARFYQLLFG VGGLSPNDIL TLEDMDPLGP EGDHHFVPVN
MHTLKNAIDT VGVPQGGAVP PDPTQEARLA AVEGRVDELD VIAARLDALE RAA