Gene Mnod_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_3858 
Symbol 
ID7302722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp3946190 
End bp3948268 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content72% 
IMG OID643601530 
Productterminase GpA 
Protein accessionYP_002499061 
Protein GI220923759 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACCA GCACCTCGCC GAACTCGGCG AGCCCACCGC GACCTTCGGC TGACACCACG 
CGCCTGCAAC GGGCCTGGCG CCGGGGGCTC ACCCCGCCCC CGAACCTCAA CGTGGTGGAA
TGGGCGGAGC GCTACCGCAG ACTGAGCAAG GAATCCTCGA ACGGCGGCCG CTTCATCGTC
TCACGCGTCG AGGTCGCCCG CGGCCCGATG TTGGCGGCGA CCGAGCCGGG AGTGCGAACC
ATCACCCTGC TGGCCTGCAC GCAGCTGCTC AAGACCACGG TCATCGAGAA CATCCTCGGC
CGCTTCGTCC ACGTGGATCC CTGCCCGATG CTGGCGGTGC TGCCCAAGGA CGACGCGGCC
GAGACCTTCT CCAAGGACCG CCTCGCGCCG ATGATCCGGG ACACCCCGGT GCTGCGCGAG
GTGTTCGGGG AGGCCAAGGC CCGCGATGCC GGGGCGACGC TGACCCACAA GCAGTTCCCG
GGCGGCCACA TCACCTTGGT CGGGGCGAAC AGCCCGACCA ACCTCGCGAT GCGGCCGATC
CGGCTGCTGG TCTGCGACGA GATCGACAAG TACCCGCTCT CCGCTGGTGG GGAGGGGCCG
CCCATCGACC TTGCCGAGGA GCGCCAGGCC GAGTTCAAGG CCACCAGCCT GTCGGTGCGG
GCCTGCTCCC CTACGGTCGC CGGGCGTTCG GCGATCGAGG CGTCTTACGA GGAGAGCGAT
CAGCGCAAGG CCTTTGTCGC CTGCCCGCAC TGCGGCGGCT GGCACCCGCT CGAATGGGAG
CAGGTGCGCT TCGACAAGGA CGAGGACGGC CGCATCCGCC CCGAGACGGC CCGCTACGAA
TGCGTGGAGT GCGAGCGGCC CTGGACCGAG GCGCAGCGCC TCATTGCCCT GCGGCGGGTC
GAGTGGCGGC AGACCCGGGC CTTCACCTGC TGCGGCGAGC GGCAGGTCCC GGAGATCTGG
GAGGAGGAGC GGCACGGGGT CCGGCGCGCG CTCTGCCGGC ATTGCGGCGC GCGGGCGCTG
CCGAACGAGC ATGCCGGGTT CCAGGTCTCG AAGCTCTACG CCCCCAAGCA GACGGTGCGC
GAGACGGTGG CGAAGTTCGC GCGGGCGCTG CGCCGCGGCC CGGAGGCGTT GCGCACCTTC
TTCAACACCC AGCTCGCCCG CACCTGGAAG GAGGGGGCGG ACGCGCCCGA GTGGCAGGAC
GTCTATGCCC GCCGGGACGA GTACCTGTCC GGCACGGTCG CGCGGGCGGC ACTGATCCTG
TTTGCGGGCG TGGACGTCCA GAAGGATCGC CTGGAGGTCG GGATCTGGGC GTTCGGGCGC
AACCGCGAGC GCTGGCTGGT GGAGCACCGG GTGCTGCCCG GGGCGACCAA CCGGCCGGAG
GTGTGGGCCG ACCTCGAAGC CATGGTCGAG GAGACCTGGC CGCACGCGAG CGGCGCCGAG
ATGACGGTGC GGGATTGGGG CATCGACTCC AGCGGCTTCA CCGCGGAGGT GTATGCCTTC
GTGCGCCGGC AGGCCGGGCG CCCGGTGCAT GCGGTGGATG GCCAGGACAG CTATGCCGGC
GCCTTCCTGG GCGTGGGCGC GAAGGATTCC ACGGCCGCCG GCCGGAAGCT GCGGCGCGGG
CTGAAGACGG TCCGGATCGG GGCGTCCTTC GCCAAGCAGG AGCTGATGGG CTGCCTTGCC
CTGCACCGCC CGCCGGAGGG CAAGCCGTTC CCGGCCGGGT TCGTGCACCT GCCGCGGGAC
GTCAGCGAGG ACCAGGTCAA GCAGCTCACG GCCGAGGAGC TGGTCACCCA CGTCACCCGG
GGCCGCACGC GCCGGGAGTG GGTGCCGATC GGCGGGCGGC GCAACGAGGT GCTGGACTGC
GCCAACTATG CCCGCGGGCT TGCCGCGATG CGGGGCTGGG ACCGCTGGCG CGAGCCGCAT
TGGCGCGAGC TCGAGGCGGC GCTCGGCATC GAGCGGCCCC GGCCAGCGGC GGATGAGCCA
GTGGTGGCGC CCGAGGTCGC GGCGCGGACG CTCGCGGCCC GCAACCAGCA GCGGCGCGCC
GTGCGGCGCA GCCGGGTCAA CAACCGGCCG ATGAGGTAG
 
Protein sequence
MSTSTSPNSA SPPRPSADTT RLQRAWRRGL TPPPNLNVVE WAERYRRLSK ESSNGGRFIV 
SRVEVARGPM LAATEPGVRT ITLLACTQLL KTTVIENILG RFVHVDPCPM LAVLPKDDAA
ETFSKDRLAP MIRDTPVLRE VFGEAKARDA GATLTHKQFP GGHITLVGAN SPTNLAMRPI
RLLVCDEIDK YPLSAGGEGP PIDLAEERQA EFKATSLSVR ACSPTVAGRS AIEASYEESD
QRKAFVACPH CGGWHPLEWE QVRFDKDEDG RIRPETARYE CVECERPWTE AQRLIALRRV
EWRQTRAFTC CGERQVPEIW EEERHGVRRA LCRHCGARAL PNEHAGFQVS KLYAPKQTVR
ETVAKFARAL RRGPEALRTF FNTQLARTWK EGADAPEWQD VYARRDEYLS GTVARAALIL
FAGVDVQKDR LEVGIWAFGR NRERWLVEHR VLPGATNRPE VWADLEAMVE ETWPHASGAE
MTVRDWGIDS SGFTAEVYAF VRRQAGRPVH AVDGQDSYAG AFLGVGAKDS TAAGRKLRRG
LKTVRIGASF AKQELMGCLA LHRPPEGKPF PAGFVHLPRD VSEDQVKQLT AEELVTHVTR
GRTRREWVPI GGRRNEVLDC ANYARGLAAM RGWDRWREPH WRELEAALGI ERPRPAADEP
VVAPEVAART LAARNQQRRA VRRSRVNNRP MR