Gene Mnod_3421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_3421 
Symbol 
ID7308786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp3541119 
End bp3542339 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID643601095 
ProductABC transporter substrate-binding protein 
Protein accessionYP_002498639 
Protein GI220923337 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCGC GGTCACGCTT CTGTGCCCTG ATGGCTCTGA TGCTGGGGTC GAGTGCAGTC 
CAGGCCCAGA TCTCCGACAA CGTGGTCAAG ATCGGCGTGC TCTCCGATAT GAGCGCCGGC
CAATCCGACA GCACGGGACC GGGATCGGTG GTGGCGGCCC GCATGGCTGT CGAGGATTTC
GGCGGAAAGG TGCTGGACCA GCCCATCGAG GTCGTCTCGG CCGATCACCA GAACAAACCC
GATGTGGGTT CGAACATCGT CCGGCAATGG CTGGAGCGAC AGCAGGTCGA CGTCGTCGCC
GACGTCCCGA CCTCGTCGGT TGCGCTCGCG GTCCAGACGC TCACGCGGGA GCGCGACCGC
ATCTTCCTGA ACTCCTCGGC AGGCTCGTCC GACCTGTCCG GACCGGCCTG CTCGCCCACG
GCGATCCACT GGACCTACGA CACCTACTCC CTGGCCAATG GGACGGCCGG TCCCCTCGTC
AGCCAAGGCG CGGATACGTG GTACTTCATC ACGGCCGACT ACGCCTTCGG CCATGCCCTC
GAACGCGACA CGAGCCAGGC CGTGACGCGG AACGGCGGCA AGGTCTCGGG CACCGTGCGG
CATCCTATGG GCATGGCCGA CTTCTCCTCG CCCCTGCTGC AGGCGCAGGC CTCGCAGGCG
AAGGTGATCG CACTGGCCGA TCCCGTCGGC GACACCGCCA CGGCGGCCAA GCAGGCCGGC
GAGTTCGGCA TCCAGGTGCA GGGCCAGAAG CTCGTGGGCC TGCTCATCGA CGTCGTCGAC
CTGCGGGCGA TCGGGCTTCC CATCGCCCAG GGCATGCTGC TGACGACCTC GTTCTACTGG
GACCGGGACG ACGAGACCCG AGCCTTCGCG AAGCGCTTCT TTGACCGCCA CAAGCGCATG
CCGACCCAGT TCCAGGCCGG CGTGTACTCG AGCATCATGC ACTACCTCAA GGCCGTGCAG
GCGGCAGGAA CCGACGAGGC GAAGGCCGTC GTCGCGAAGA TGCGGGAGAT GCCGGTCAAC
GACTTCTTTG CCCGGAACGG CAGGCTGCGC GAGGACGGTC GCATGGTTCA CGACATGTAC
CTCATGCAGG TCAAATCGCC GGCCGAGTCG AAGGGCGAGT GGGATCTGCT CAAGCTCGTG
CAGACGATCC CGGGCGAGCG GGCCTTTCGC CCACTCGATG CCGGCGGCTG CCCCTTGGTC
GCCAAGGACC GGAAAGACTA G
 
Protein sequence
MKARSRFCAL MALMLGSSAV QAQISDNVVK IGVLSDMSAG QSDSTGPGSV VAARMAVEDF 
GGKVLDQPIE VVSADHQNKP DVGSNIVRQW LERQQVDVVA DVPTSSVALA VQTLTRERDR
IFLNSSAGSS DLSGPACSPT AIHWTYDTYS LANGTAGPLV SQGADTWYFI TADYAFGHAL
ERDTSQAVTR NGGKVSGTVR HPMGMADFSS PLLQAQASQA KVIALADPVG DTATAAKQAG
EFGIQVQGQK LVGLLIDVVD LRAIGLPIAQ GMLLTTSFYW DRDDETRAFA KRFFDRHKRM
PTQFQAGVYS SIMHYLKAVQ AAGTDEAKAV VAKMREMPVN DFFARNGRLR EDGRMVHDMY
LMQVKSPAES KGEWDLLKLV QTIPGERAFR PLDAGGCPLV AKDRKD