Gene Mnod_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_4043 
Symbol 
ID7303420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp4112899 
End bp4114020 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content62% 
IMG OID643601695 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002499225 
Protein GI220923923 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGCC CTACCATCGC CGATGTGGCC AAGGCCGCCA ATGTCAGCGT GTCAACGGTC 
GATCGTGTCC TGAGCGGCCG CCATTCCGTG CGGGAGGCAA CCGCCGAACG GGTGCAACGT
GCCGCGGAGG CCGTCGGCTT TCATCTCGCT GGGACGATCC GTCACCGCCT CGGACAGGAT
CGGCCTGCCC GCACCCTCGG CTTCCTCCTC CAGCAGCGGC AAAACGAGTT TTACGAGACA
CTCGGGCAGG TCCTGCAGGA GGCAACGGAT GCCTCGACGA CCATCAGGGG CCAGGCGGTG
GTGAGGTTCC GGGACTACCA GGAGGAGGAG GTCGCCGCGG AATGCCTCCT CCAGCTTGGG
CGGGAATGCA ATGCCGTGGC GGTCGTCGCC GCAGATCACC CCAAGGTGAC GCAGGCGATC
GACACTCTGC ACGAGGAAGG CGTGCCTGTA TTCGCGCTCA TTTCCGAGCT CACGGCGGCG
AACAGTGCCG GCTACGTAGG GCTGGACAAC TGGCAGGTCG GCGGCACCGC GGCTTGGTTC
CTCTCCAACA TGTGCAGGAC ACCGGGAAAG ATCGGCATCT GTGTGGGGAG CCTTTGCTTC
CAACGTGCCA GCGAAATGCG CTTCCGTTCG TACTTCCGCG AACGTGCACC AGAGTTTCAG
TTACTCGATT CCACGGTGAC CCTCGACGAC GAGCGCTACG CCTATGAATG CACCCGGGAT
TTGTTGCGGC AGACGCCGGA TCTGGTTGGC ATTTACGTCG CCGGTGGCAG CATCACCGGC
GTCATCCGAG CCTTGCGTGA ACTCCCGAGC GCCGCATCCC GAAACCTCGT GGTTATCGGA
CGGGAACTGA TACCTGATAC CATGAGAGGG CTCATCGAGG GTTTGATCAA TGTTGTCCTG
TCGCATCCGA AGAAATTGCT GGCCGAGACG CTGGTCGAGG CGATGGCGCA GTCAACGATC
AGCAACCAAG GCGGCAGCTA CGTGCACCCT CCCATTCCCT TCGATATCTA CGCGCCCGAA
AATCTTTGGG CATTTCGATT ATCTGACTTG CGGGGAATGT ATGAGCCGGG GTGGCCATGG
CGGCCGAACA TAGCACGGAG CCGGAAATTG AGTGGCGTGT GA
 
Protein sequence
MPRPTIADVA KAANVSVSTV DRVLSGRHSV REATAERVQR AAEAVGFHLA GTIRHRLGQD 
RPARTLGFLL QQRQNEFYET LGQVLQEATD ASTTIRGQAV VRFRDYQEEE VAAECLLQLG
RECNAVAVVA ADHPKVTQAI DTLHEEGVPV FALISELTAA NSAGYVGLDN WQVGGTAAWF
LSNMCRTPGK IGICVGSLCF QRASEMRFRS YFRERAPEFQ LLDSTVTLDD ERYAYECTRD
LLRQTPDLVG IYVAGGSITG VIRALRELPS AASRNLVVIG RELIPDTMRG LIEGLINVVL
SHPKKLLAET LVEAMAQSTI SNQGGSYVHP PIPFDIYAPE NLWAFRLSDL RGMYEPGWPW
RPNIARSRKL SGV