Gene M446_5073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5073 
Symbol 
ID6135298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5563638 
End bp5564870 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content69% 
IMG OID641645208 
Productputative substrate-binding protein 
Protein accessionYP_001771833 
Protein GI170743178 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.210468 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATGA CGCGTCGCGG CGTGCTCGCC GCCAGCGTCG GCGCGAGCCT GACGCCGATG 
CTGCCCCGCC TGCCGCGGGC GCAGGCCGCC GACACGATCC GGATCGGCGT CCTGAACGAT
CAGTCCGGCC CCTACCGGGA CATCAGCGGG CCCTACGGCG TCGCCTGCAC CCAGATGGCG
GTCGACGAGT TCGCCGCGAA CGGCTTCCGG GTCGAGGTGA TCTCCGCCGA CCACCAGAAC
AAGCCCGATG TCGGGGCCAG CATCGCCCGG CAATGGTACG ACCGCGACGG CGTCGACATG
ATCATCGACG TGCCGACTTC CTCGGTCGGC CTCGCGGTGA ACCAGGTCGC CCGCGAGAAG
AACAAGGTCT ACATCAATAC CGGGGCGGCG ACGTCGGACC TCACGGGGGT GCAGTGCACG
CCGGTGACGA TCGCCTGGAT GTACGACACC TACATGCTGG CGAAGTCGAC CGGCGCCGCC
ATGGTGCGGG CGGGGGGCGA CTCGTGGTTC TTCGTCACCG CCGACTACGC CTTCGGGCAC
GCGCTGGAGC GGGACACCAC GGCCTTCATC AAGGCGGCGA ACGGGCGGGT GCTCGGCAGC
GTGCGCTACC CCTTCCCGTC GACCAGCGAC TTCTCCTCCT TCCTGGTCCA GGCCCAGGCG
AGCGGCGCCA AGGTGATCGC CTTCGCCAAT GCGGGCGCCG ACACGGTCAA CTGCATCAAG
CAGGCGGCGG AGTTCGGCGT GACCCAACAC GGCACGAAGC TCGCGGCCCT GCTGATGTTC
CTCTCCGACG TGCACGCGCT CGGGCTGCAG GCCGCGCAGG GCCTGATCCT GACCGAGAGC
TTCTACTGGG ACCTCAACGA CCGCACCCGC GCGGTGACGA GGCGCGCCCT GCCGCGGTTG
AGCGGCGTCT ACCCGAACAT GGCCTCGATC GCCGATTACT CCGCGACGCT GCACTACCTC
AAGGCGGTGG CCGACATGGG GGTGAAGGCC GCCAAGGCCT CGGGCGTCGA CGTGGTGAAC
CGCATGAAGG CGATGCCGAC GGACGACGAC GCCTTCGGAC CCGGGCGCAT CCGCGAGGAC
GGCCGCAAGC TCTGCCCCTC CTACCTGTTC GAGGTGAAGA CGCCCGCCGA GAGCAGCAAG
CCCTGGGATT ACTACAAGCT CATCGGCACC ACCCCGGCCG AGGAGGCGTT CCGCCCGCTC
GACAAGGGCA ATTGCCCGCT CGTGAAGGCG TGA
 
Protein sequence
MRMTRRGVLA ASVGASLTPM LPRLPRAQAA DTIRIGVLND QSGPYRDISG PYGVACTQMA 
VDEFAANGFR VEVISADHQN KPDVGASIAR QWYDRDGVDM IIDVPTSSVG LAVNQVAREK
NKVYINTGAA TSDLTGVQCT PVTIAWMYDT YMLAKSTGAA MVRAGGDSWF FVTADYAFGH
ALERDTTAFI KAANGRVLGS VRYPFPSTSD FSSFLVQAQA SGAKVIAFAN AGADTVNCIK
QAAEFGVTQH GTKLAALLMF LSDVHALGLQ AAQGLILTES FYWDLNDRTR AVTRRALPRL
SGVYPNMASI ADYSATLHYL KAVADMGVKA AKASGVDVVN RMKAMPTDDD AFGPGRIRED
GRKLCPSYLF EVKTPAESSK PWDYYKLIGT TPAEEAFRPL DKGNCPLVKA