Gene M446_2133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2133 
Symbol 
ID6130910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2381991 
End bp2383070 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content78% 
IMG OID641642361 
Producturea amidolyase related protein 
Protein accessionYP_001769029 
Protein GI170740374 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.335875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGG GCCTGCGCGT CCTCTCGGCC GGTGCGGGCG CCACCCTGCA GGATGCCGGG 
CGCCACGGCT ACCTGCGCTA CGGCATCACC GCGGCCGGGC CGATGGACCC GCTCGCGCAC
GCCACCGCCA ACCGCGCCCT CGACCGGCCC GCCGGTGCGA CCGCGATCGA AGTCTCGCTC
GCCGGCATCG AGGTGACGGC CGAGGGCGCC CCGCTCGGCG TGGCCCTCGC GGGCGGGGCC
TTCCGCGTCA GCCTCGACGG CGAGCCGGTG CCGCCCGCCG CTTCCCTGAC GCTGCGCCCC
GGCGCCACCC TCGCGCTGCG GGCCGGCCGC GACGGGGCGT GGTGCTACCT CGCGCTCGCC
GGCCGGATCG ACGTGCCGCC GGTGCTCGGC GCCACCGCGA CCCACACCCG CTCGCGGATC
GGGGGCTTCG ACGGCCGCGC CCTGCGGGCG GGCGACCTGC TGCCGGTGCG CGATCCCCGG
CCCGGCCCGG CGGAGATCCA CCGCCTCGCC GCCCCCTGGC TCGACCGGCC GCCCGAGGTG
ATCCGCGTCG TGCCGGGGCC GCAGGACGAT TACTTCGCCC CGGACCAGTT CGCCGCCTTC
CTGGCCGGTC CCTGGACGGT GACGCCGCGG GGCGACCGCA TGGCCTGCTT CCTCGACGGA
GCCCCCCTCG TCCACGCGCG CGGCTTCGAC ATCGTGTCGG ACGGCATCGC CATGGGGGCC
GTGCAGGTGC CCGGCGAGGG CAGGCCGATC GTCCTGATGG CCGACCGGCA ATCCACCGGC
GGCTACCCCA AGATCGCGAC CGTGATCGGG CCGGATCTCG GCCGCCTCGC CCAGGCCCAG
GCCGGCACGC GCCTGCGCTT CCAGGCCGTC TCGGTCGCCG AGGCGGTCGC CGCGCGCCGG
GCCGAGGCCG CGGCCCTGGC GCCGCCGGTC GCCCTGGAGC CCCTGGTGCG GACGCGGTTC
AGCCCGGAAT TCCTGCTCGG GCGCAACCTG ATCGGCGGCG TCGTGGATGG GGGGGCCTCG
GATGGAGGGG CCTCGGACGG GGGAGCCTCG GATGGGGGAG CCTCGGACGG GGGGGCCTGA
 
Protein sequence
MSAGLRVLSA GAGATLQDAG RHGYLRYGIT AAGPMDPLAH ATANRALDRP AGATAIEVSL 
AGIEVTAEGA PLGVALAGGA FRVSLDGEPV PPAASLTLRP GATLALRAGR DGAWCYLALA
GRIDVPPVLG ATATHTRSRI GGFDGRALRA GDLLPVRDPR PGPAEIHRLA APWLDRPPEV
IRVVPGPQDD YFAPDQFAAF LAGPWTVTPR GDRMACFLDG APLVHARGFD IVSDGIAMGA
VQVPGEGRPI VLMADRQSTG GYPKIATVIG PDLGRLAQAQ AGTRLRFQAV SVAEAVAARR
AEAAALAPPV ALEPLVRTRF SPEFLLGRNL IGGVVDGGAS DGGASDGGAS DGGASDGGA