Gene M446_5033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5033 
Symbol 
ID6129475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5514099 
End bp5515229 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content75% 
IMG OID641645169 
Productpeptidase dimerisation domain-containing protein 
Protein accessionYP_001771794 
Protein GI170743139 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.379056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00853617 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCCCCG AATCCGCCGC CCCGTCGCCC GAGGAGGCCG TCGCCGCGAT CAGCCGCTGG 
CTCTCGGTCG AGAGCCCGAC CCACCACGCG GCCGGGGTCA ACCGGATGAT GGACCTCGTC
GCCGACGAGG CCGAGGCGAC CGGCATCCCG TGGGAGCGGA TCGGCGGCAC GCAGGGCCTC
GGCGACAGCC TGATCCTGCG GGCCGGGCCG CGGACCGGGG AGCCCGCCCT CCTGGTCCTG
TCGCACCTCG ACACGGTCCA TCCGGTCGGC ACCCTGGCGG AGCTGCCGGT GCGGGTCGAG
GGCGACCGGC TCTACGGGCC GGGCGTGTAC GACATGAAGG GCGGGGCGTG GCTCTGCCTG
CAGGGCTTCA TCGCCGCGGC GAAGGGCGGG CAGGCCCGGC GGCCCCTCGT CTTCCTGTTC
ACGAGCGACG AGGAGATCGG CTCGCCGACG ACCCGCGGGC TGATCGAGGA TCTGGGGCGG
CGGGCCGAGG CGGTGCTGGT GACCGAGCCC GGCCGGGACG GCGGCCGGGT GGTCACGGGC
CGCAAGGGCG TCGGGCGCTT CGACATCCAC GTGGAGGGGC GCCCCGCCCA TGCCGGTAGC
CGCCACGCGG AGGGGCGCAA CGCGATCCGC GAGGCCGCCC GGCTGATCCT GGAGATCGAG
GCCCTGACCG ACTACGCGCG CGGCATCACC ACCACGGTCG GGCTGGTCCA GGGCGGCACC
GCCGAGAACG TGGTGCCGCA GCATTGCCGC TTCACCGCGG ACCTGCGGGT GGTGACGGAG
GAGGACGGGC GGGCCTGCGT GGCGCGCCTC CGCGGCCTGC AGGCCGCGCC CGACTTCACC
GTGACGGTGA CCGGCGGCAT GAACCGCCCG CCCTATCCGC GCTCGGACCT GACCGGCCGG
CTCTTCGCGC AGGCGCGCGC CATCGCCGAG CAGGAGCTCG GCCTCGCCCT CGGCGAGGTG
CCGCTGACGG GCGGCGGCTC GGACGGGAAC TTCACGGCGG CGCTCGGCGT GCCGACCCTC
GACGGCCTCG GCATCGACGG GGACGGCGCC CACACGCTGT GGGAGTACGG CCTGATCTCC
TCCATCGCGC CGCGGCGGCG GCTGATGCAG CGGATGCTGG AGACGCTGTG A
 
Protein sequence
MSPESAAPSP EEAVAAISRW LSVESPTHHA AGVNRMMDLV ADEAEATGIP WERIGGTQGL 
GDSLILRAGP RTGEPALLVL SHLDTVHPVG TLAELPVRVE GDRLYGPGVY DMKGGAWLCL
QGFIAAAKGG QARRPLVFLF TSDEEIGSPT TRGLIEDLGR RAEAVLVTEP GRDGGRVVTG
RKGVGRFDIH VEGRPAHAGS RHAEGRNAIR EAARLILEIE ALTDYARGIT TTVGLVQGGT
AENVVPQHCR FTADLRVVTE EDGRACVARL RGLQAAPDFT VTVTGGMNRP PYPRSDLTGR
LFAQARAIAE QELGLALGEV PLTGGGSDGN FTAALGVPTL DGLGIDGDGA HTLWEYGLIS
SIAPRRRLMQ RMLETL