Gene Mnod_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_2023 
Symbol 
ID7305212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp2122280 
End bp2123527 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content66% 
IMG OID643599757 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_002497312 
Protein GI220922011 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGAAC AGGGAACGCG CCAGGGCTCC GTCTCGCGCC GGATGCTCGT GCGCGGCATG 
GCCGCCACTG GCGCGCTCGC CGGCATCGGC ATGCCTTTCG TCGCGCGCGC GGCTGAGCCG
ATCCGCATCG GCTTCCCGAC GCCGGTCACC GGCCCGTTCG GCGCCGAAGC CAAGGATCAA
ATCCGCTCCG CCGAACTCGC CGTGAAGCAG TTCAACGAAG CGGGTGGCGT GAACGGACGG
ACGGCCGAGC TTCTAGTCCG CGACGACAAG CTCAATCCCG GCGAGGCCGC GACCCGGACG
CTAGAGCTCA TCGAGAAGGA CAAAGTCCAC TTCATCGTAG GCGCGCTTTC GAGCGCGGTC
CAGCTTTCCG TCAACGAGAT CACCCGCTCG CGCAAGGTCC TCTATGTGTC GATCAGCCAG
TCTGACACGA TCAACGAGGC CAAAGACTTC AGCCGCTACA CCTTCCACGA GGCGCTGAAC
CCGCACATGA CCACCGCGGC GGTGGCCAAG CACGCGTTCA AGAAGGGCAC CAAGGTCGCA
TACCTGGCCG CCGACTATGC CTACGGCCAC GAGATGCTGC GCGGCTTCAA GCGCGCGGCG
GCCGCCATTG GCGCCGAGAC GGTCGGCGAG ATCCTGCACC CGTTCGGCGC GCCCGACTAC
TCGACCTTCA TGCCTCGGCT GCGCTCCATG CGCCCCGACA TCCTGTGCAT CTGCAATTTC
GGCCGCGATC AGGCCAATAG CATCAAGCAG GCCAGCGATT TCGGGTTGAA GAAGGGCGCC
CAGATCGTCG TCCCGGTTCT GCTGCACAAC CAACGCCTCG CCGGCGGCGC CGACGCCTTC
GAAGGCGTGG TAGGGGCCAG CAACTACTAC TGGCGCCTTG AGGAGACCGT CCCGTCGGCA
AAAGCCTTCA ACGACGCCTT CCGGGCCGCC TACGCCGACG CGATCCCGAC CGATTACGGC
GCCTACGGTT ACACCGCCGT TCGCTCACTG CTGATGGCGG TGAAAGCGGC CGGCGACACC
GACACCGACA AGGTCATCGC GGCGTTGGAG GGACTGACAT ACGACGTCGC CAAGGGCCCG
GAGCGCTACC GCGCCTGCGA CCACCAGGCG ATCCAGTCCG TGCTCATCAC CGTATCCAAG
AAAAAGTCCG AGATGCAGGG CGAGGCGGAC CTTTTCCGGA TCCTGGAGGT CGAGGCGGGC
TCCGAGAACG CGCTCCGCAC CTGCAACGAA CTCGGTCACC GCGCCTGA
 
Protein sequence
MIEQGTRQGS VSRRMLVRGM AATGALAGIG MPFVARAAEP IRIGFPTPVT GPFGAEAKDQ 
IRSAELAVKQ FNEAGGVNGR TAELLVRDDK LNPGEAATRT LELIEKDKVH FIVGALSSAV
QLSVNEITRS RKVLYVSISQ SDTINEAKDF SRYTFHEALN PHMTTAAVAK HAFKKGTKVA
YLAADYAYGH EMLRGFKRAA AAIGAETVGE ILHPFGAPDY STFMPRLRSM RPDILCICNF
GRDQANSIKQ ASDFGLKKGA QIVVPVLLHN QRLAGGADAF EGVVGASNYY WRLEETVPSA
KAFNDAFRAA YADAIPTDYG AYGYTAVRSL LMAVKAAGDT DTDKVIAALE GLTYDVAKGP
ERYRACDHQA IQSVLITVSK KKSEMQGEAD LFRILEVEAG SENALRTCNE LGHRA