Gene M446_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3931 
Symbol 
ID6130250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4379706 
End bp4381205 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content76% 
IMG OID641644089 
Productcarboxypeptidase Taq 
Protein accessionYP_001770731 
Protein GI170742076 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.123497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0158705 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCAT ATCGCTCCCT CGCGGAGGGT TTCGGGCGGA TCGCGGCGCT GGAGGGCGCC 
GCGGGCATCC TCGACTGGGA TTCCCGCACC CAGATGCCGG ACGGGGCCGC GGAGGGCCGC
GCCGACCAGC TCGCCGCCCT GCAGGGGCTG ATCCACGACC TCCTGACCGC CCCGGCCCTC
GACGAGGAGC TCGCCCGGGC CGAGGAGGAG CCCCTCGACC CCGGTGCGCG GGCCAACCTG
CGCGAGATGC GCCGGGTGCG CCGGCACGCC GCGGCGGTCC CGCGCGACCT CGTGGAGGCC
AATGCCCGCG CGGTGATGCG GGCCGAGATG GTGTGGCGGG AGGCGCGGGC GCGGTCGGAT
TTCGCCCTGC TCGCGCCCCA CCTCGCCGAG GTGCTGCGGC TCCAGCGCGA AATCGGCGCG
GCCAAGGGCG CGGCCCTCGG CCTCTCGCCC TACGACGCGC TCCTCGACGC CTACGACCCG
GGCCTGCGGC GCGGCCTGAT CGAGCCGCTC TTCGCCGAGC TGCGCGCCGT CCTGCCGGGG
CTGATCGAGG CCGTCCGCGC GCGCCAGGCG GCGGCCCCGC CCGCGCTGCC GCTCCCGGGC
CCCTTCCCGG TGGAGACCCA GCGGGCGGTG GGGCTCGCGC TGATGGGGGC CGCGGGCTTC
GACTTCCGGC GCGGGCGCCT CGACGTCAGC CTGCACCCGT TCTGCGGCGG CGCCGCCGAC
GACGTGCGCA TCACCACCCG CTACGACGAG CGCGACGTGA TGCGGGCGCT GATGGGCGTG
CTGCACGAGA CCGGGCACGC GCTCTACGAA CAGGGCCGAC CGGCGGCGTG GCGGGGCCAG
CCGGCCGGGC AGGCCCGCGG CATGAGCCTG CACGAGAGCC AGTCGCTGAT CATCGAGATG
CAGGCCGGCC GCTCGCCCGA ATTCCTGAGC TTCCTCGCCC CCCTGCTCGG CCGGCACTTC
GCCGGCGAGG GGCCCGCCTG GAGCGCCGCG AACCTGACCC GCCTCGCGAC CGCGGTCTCG
CCCGGCCTCA TCCGGGTCGA TGCCGACGAG GTCACCTATC CGGCCCATAT CCTGCTGCGG
ACGGAGCTGG AGATCGCGAT GATCGCGGGC GACCTCGCGG TCGCCGACCT GCCCGAGGCC
TTCGCCGCCG GAATGCGCGA CCTCCTCGGC CTCGCGGTGC CGAACGATGC CCTCGGCTGC
CTGCAGGACA TCCACTGGCC GGGCGGCTCC TTCGGCTATT TCCCGACCTA CACGCTCGGG
GCGATGATGG CCGCGCAGCT CTTCGCGGCG GCCTGCGCGG CGGAACCCGG GATCCGGCCC
GGCCTCGCGC GGGGGGATTT CGCGCCCCTC GTCGGCTGGC TGCGCCGCCA CGTGCACGAG
CGGGCGAGCC TCCTCGACAC GCAGGACCTC CTCGTGGCGG CGACCGGCCA GCCCCTCTCC
AGCGCGCCGT TCCTGGGCCA CCTGCGGCAG CGCTATCTCG GGGAGGCGGA CGCCGGGTGA
 
Protein sequence
MDAYRSLAEG FGRIAALEGA AGILDWDSRT QMPDGAAEGR ADQLAALQGL IHDLLTAPAL 
DEELARAEEE PLDPGARANL REMRRVRRHA AAVPRDLVEA NARAVMRAEM VWREARARSD
FALLAPHLAE VLRLQREIGA AKGAALGLSP YDALLDAYDP GLRRGLIEPL FAELRAVLPG
LIEAVRARQA AAPPALPLPG PFPVETQRAV GLALMGAAGF DFRRGRLDVS LHPFCGGAAD
DVRITTRYDE RDVMRALMGV LHETGHALYE QGRPAAWRGQ PAGQARGMSL HESQSLIIEM
QAGRSPEFLS FLAPLLGRHF AGEGPAWSAA NLTRLATAVS PGLIRVDADE VTYPAHILLR
TELEIAMIAG DLAVADLPEA FAAGMRDLLG LAVPNDALGC LQDIHWPGGS FGYFPTYTLG
AMMAAQLFAA ACAAEPGIRP GLARGDFAPL VGWLRRHVHE RASLLDTQDL LVAATGQPLS
SAPFLGHLRQ RYLGEADAG