Gene Mnod_4078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_4078 
Symbol 
ID7303455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp4144895 
End bp4146268 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content69% 
IMG OID643601730 
Productphage portal protein, HK97 family 
Protein accessionYP_002499260 
Protein GI220923958 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.333282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCATCT TCGGGCTGAC GATCACCCGC GAGAAGGCGG CCCCGACTGC CTCTCCGGTC 
GACACCCGCG GCGGCTGGTG GGGCATTGTC CGGGAGGCCT TCACGGGGGC GTGGCAGAAG
AGTGTCGAGG TGAGGCTCGA CACGGTGCTG ACCTACAGCG CCGTGTTCCG CTGCGTGTCC
CTCATCGCCT CGGATATCGC CAAGATGCGC CTGCGGCTCG TGTCGCAGGA TGCGGACGGG
ATCTGGACCG AGACCAGCAG CCCGTCCTTC TCGCCGGTCC TGCGCAAGCC GAACCGCTTC
CAGAACCGGA TCCAGTTCAT CACGAGCTGG GTCGAGTCGA AGCTGATCCA CGGCAACACC
TACGTGCTCA AGGAGCGCGA CAGCCGTCGC GTGGTGGTCG CCCTCTCCGT GCTCGACCCC
ACCCGCGTGA AGCCGCTGGT CGCCCCCGAC GGCGAGGTCT TCTACCAGCT CTCCCGCGAC
GATCTGGCCG GGGTCAGCGA CCTGGATGCC GCCCTGCTCG TGCCGGCCAG CGAGATCATC
CACGACCGCT GGAACACGCT CCATCACCCG CTGGTCGGCA CCTCCCCCAT CTACGCCTGC
GGTCTCGCCG CGGTGCAGGG GATCCGGATC CAGACCAACA GCGCGCACTT CTTCGGCAAC
GGCTCGCAGC CGAGCGGGAT CCTGGTGGCG CCCGGCCCGG TCTCGGAGGA GAACGCCAAG
CGCCTGAAGG CGCATTGGGA GCAGAACTTC ACGGGCCCGA ACGTCGGCCG GGTGGCGGTG
CTGGGCGACG GCCTGCGCTA CGAGCCCATG GCCGTGAAGG CCAGCGATGC CCAGCTGATC
GAGCAGCTGA AGTGGAGCGC CGAGACGGTC TGCTCGGTGT TCGGGGTGCC GGCCTACAAG
ATCGGGGTCG GCGCGCCGCC CGCCTACACC AACATCGAGG CCTTGGACGC GCAATACTAT
GCGCAGTGCC TGCAGATCCA CATCGAGAGC ATCGAGCTGT GCCTCGATGA GGGGCTTGCT
CTGCCGGCGC CGTATGGGAC CGAGTTCGAG CTCGACACCC TCCTGCGCAT GGATACCGCG
ACCCAGATCC GGACCTACGC CGAGGGCGTG AAGGGCGGCC TGCTGAAGCC GGACGAGGGC
CGGGCGAAGC TCGGGCTGCC GCCGGTGACC GGCGGCAACG CGGTCTACCT GCAGCAGCAG
AATTTCAGCC TCGCGGCGCT GGCCAAGCGC GACGCCCAGG CCGACCCGTT CAATCCCTCC
GCCCCCGCAT CTCCGCCCCC AGAGCCCGCG CCGCCGCCAG ACGCGGCAGA GGAGGTCAGT
CGCTTCGCCT CAGCGCTGCG GCTCAAGCTG GCAGAGGCGA TTGTGAATGC GTGA
 
Protein sequence
MRIFGLTITR EKAAPTASPV DTRGGWWGIV REAFTGAWQK SVEVRLDTVL TYSAVFRCVS 
LIASDIAKMR LRLVSQDADG IWTETSSPSF SPVLRKPNRF QNRIQFITSW VESKLIHGNT
YVLKERDSRR VVVALSVLDP TRVKPLVAPD GEVFYQLSRD DLAGVSDLDA ALLVPASEII
HDRWNTLHHP LVGTSPIYAC GLAAVQGIRI QTNSAHFFGN GSQPSGILVA PGPVSEENAK
RLKAHWEQNF TGPNVGRVAV LGDGLRYEPM AVKASDAQLI EQLKWSAETV CSVFGVPAYK
IGVGAPPAYT NIEALDAQYY AQCLQIHIES IELCLDEGLA LPAPYGTEFE LDTLLRMDTA
TQIRTYAEGV KGGLLKPDEG RAKLGLPPVT GGNAVYLQQQ NFSLAALAKR DAQADPFNPS
APASPPPEPA PPPDAAEEVS RFASALRLKL AEAIVNA