Gene Mnod_5065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_5065 
Symbol 
ID7303758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp5140970 
End bp5142337 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content68% 
IMG OID643602695 
Productphage portal protein, HK97 family 
Protein accessionYP_002500214 
Protein GI220924912 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCTTCT TCGGGCTGAC GATCACCCGC GAGAAGGCGG CCCCGACTGC CTCTCCGGTC 
GACACCCGCG GCGGCTGGTG GGGCATTGTC CGGGAGGCCT TCACGGGCGC GTGGCAGAAG
AGTGTCGAGG TGAGGCTCGA CACGGTGCTG ACCTATAGCG CCGTGTTCCG CTGCGTCTCC
CTAATCTCGT CTGACATCGC CAAGATGCGC CTGCGGCTCG TGCAGCAGGA TGCGGACGGG
ATCTGGACCG AGACCAGCAG CCCGTCCTTC TCGCCGGTCC TGCGCAAGCC GAACCGGTTC
CAGAACCGGA TCCAGTTCAT CACGAGCTGG GTGGAGTCGA AGCTGGTCCA TGGCAACACC
TACGTGCTCA AGGAGCGCGA CAGCCGTCGC GTGGTGGTCG CCCTCTCCGT GCTCGACCCT
ACCCGCGTGA AGCCGCTGGT CGCCCCCGAC GGCGAGGTCT TCTACCAGCT CTCCCGCGAC
GATCTGGCCG GGGTCAGCGA CCTGGATGCC GCCGTGCTGG TGCCGGCCAG CGAGATCATC
CACGACCGCT GGAACACGCT CCATCACCCG CTGGTCGGCA CCTCGCCCAT TTATGCCTGT
GGTCTCGCCG CGGTGCAGGG CATCCGGATC CAGACCAACA GCGCGCACTT CTTCGGCAAC
GGCTCGCAGC CGAGCGGGAT CCTGGTGGCG CCCGGCCCGG TCTCGGAGGA GAACGCCAAG
CGCCTGAAGG CGCATTGGGA GCAGAACTTC ACGGGTCAGA ACGTCGGCCG GGTGGCGGTG
CTGGGTGATG GCCTGCGCTA CGAGCCCATG GCCGTGAAGG CCAGCGATGC CCAGCTGATC
GAGCAGCTGA AGTGGAGCGC CGAGACGGTC TGCTCGGTGT TCGGGGTGCC GGCCTACAAG
ATCGGGGTCG GCGCGCCGCC CGCCTACACC AACATCGAGG CCTTGGACGC GCAATACTAT
GCGCAGTGCC TGCAGATCCA CATCGAGAGC ATCGAGCTGT GCCTCGATGA GGGGCTTACT
CTGCCGGCAC CCTATGGGAC CGAGTTCGAG CTCGACGCCC TCCTGCGCAT GGATACCGCG
ACCCAGATCC GGACCTACGC CGAGGGCGTG AAGGGCGGCC TGATGAAGCC GGACGAGGGC
CGGGCGAAGC TCGGGCTGCC GCCGGTGACC GGCGGCAATG CCGTCTACCT GCAGCAGCAG
AACTACAGCC TCGCGGCGCT GGCCAAGCGC GACGCCCAGG CCGACCCGTT CAATCCCGCT
CCGGCAGCAT CCCCTCCGGA GCTTGCGCCG CCAGACGCGA CAGAGGAGGT CAGCCGCTTC
GCGACAGCGC TGCGGCTCAA GTTCGCCGAG GCGCCTGCGC ATGCGTGA
 
Protein sequence
MRFFGLTITR EKAAPTASPV DTRGGWWGIV REAFTGAWQK SVEVRLDTVL TYSAVFRCVS 
LISSDIAKMR LRLVQQDADG IWTETSSPSF SPVLRKPNRF QNRIQFITSW VESKLVHGNT
YVLKERDSRR VVVALSVLDP TRVKPLVAPD GEVFYQLSRD DLAGVSDLDA AVLVPASEII
HDRWNTLHHP LVGTSPIYAC GLAAVQGIRI QTNSAHFFGN GSQPSGILVA PGPVSEENAK
RLKAHWEQNF TGQNVGRVAV LGDGLRYEPM AVKASDAQLI EQLKWSAETV CSVFGVPAYK
IGVGAPPAYT NIEALDAQYY AQCLQIHIES IELCLDEGLT LPAPYGTEFE LDALLRMDTA
TQIRTYAEGV KGGLMKPDEG RAKLGLPPVT GGNAVYLQQQ NYSLAALAKR DAQADPFNPA
PAASPPELAP PDATEEVSRF ATALRLKFAE APAHA