Gene M446_5407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5407 
Symbol 
ID6133730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5940547 
End bp5942127 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content75% 
IMG OID641645541 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_001772157 
Protein GI170743502 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.451508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCA TCCTGATGCT GCTCGCCGCG CTCGCCGCGG CAGCGCCCGG CCCCTCGCTC 
GCGCAGCAGC CGCGGCCGCA GGACCCGCGC GCGCAGGAAT CGCGCGCGCC CGAGTCCCGC
CCGTCCGAGT CCCGCCCGTC CGAGTCCCGC CCGCCCGAGG GGCGCAGGCT GCCCCCCGAC
GCGGTGACGC AGCACAGCCT CGCCCTCTCG GACGGGCGCA GCCTCGCCTT CACGGCCACG
GCCGGCAGCC TCGCCCTCGT CGACGAGGCC GGCAAGCTCC AGGCCGAGAT CGCCTTCACG
GCCTTCACCC TGCCGGAGCG CCCGCGGGCG ACGCGGCCCG TCACCTTCGC GCTCAACGGC
GGTCCCGGCG CGGCCTCCGC CTATCTCAAT CTCGGGGCGG TCGGCCCCTG GCGCCTGCCC
CTCGACGGGC CGAGCATCAG CCCCTCGGCG GCGCCGGTGC CGCTGCCCAA CGACGAGACC
TGGCTCGACT TCACCGACCT CGTCTTCCTC GATCCGGTCG GCACCGGCTA CAGCCGGGCG
GCCGGGGACG ACGCCAAGCG CTACTTCTCG GTCGATGCGG ACGCCTCGGT CCTCGCCGCG
GCGATCGCCC GCTGGCTGCG CACCAACGAC CGCCTGACCT CGCCGAAATT CTACCTCGGG
GAGAGCTACG GCGGCTTCCG CGGCCCGCTC ATCGCCCGCA AGCTCCAGGA CGACGTCGGG
GTCGGCCTCT CGGGCCTCGT TCTGCTCTCG CCCGTGCTCG ATTTCGGCTG GCTTCAGCCG
CCGCGGCACA ATCCCCTCGG CGACGTCACC CGGCTGCCCT CCCTGGCCGC CGCCGCCATC
GAGCGCCGCG GCGGCAGCCC GGACCCGGGG GCGCTGGCGG AGGCCGAATC CTACGCCACC
GGCGAGTACC TGAGCGACCT ACTGCGCGGG CCGCGGGACG GCGCGGCGCG CGACCGGCTC
GCCCGCCGGG TCGCGGCGCT CACCGGGCTC GACCCGGACC TGGTGCGGCG GCAGGCCGGG
CGGATCTCGA CCGGCAGCTA CCAGCGCGAG AGCGGCCGGG CGGAGGGGCG CGTCGCCAGC
GCCTACGACA CCGGCGTCAC CGGCTGGGAC CCGGAGCCGA ACGCGGCCCA TGCGGGCTTC
GAGGACCCGC TCCTCTCCGC CATGCAGGCG CCCCTCTCGA GCGCGATCGT CGACCTCACC
GCCCGCACCC TGAACTGGCG GGTGACGAAT CTGCGCTACG AGCTGCTCAG CACCGGCGTG
AACCGCCAGT GGAACTGGGG CTCCGGCCGC ACCCCGCCGG AGGTGGTGAG CGACCTCAGG
CAGGCCCTCG CCCTCGACGG GTCGCTGCGG GTGCTGGTCG CGCACGGCTA CACGGACCTC
GTCACGCCCT ACTTCGCCTC GCGGCTCATC CTCGACCAGA TCCCCGCCTA CGGTCCGGGC
CAACGGCTCA GCTTGGCGGT TTTCCCGGGC GGCCACATGT TCTATTCCCG CCAGGCCTCG
CGGGCGGCGC TCCGGGGCGA GGCGCTGCGC CTCTACGAGG CGGCGCTGGC GGCGCGGCAG
GGAAGCGGCG AGGGACGATG A
 
Protein sequence
MIRILMLLAA LAAAAPGPSL AQQPRPQDPR AQESRAPESR PSESRPSESR PPEGRRLPPD 
AVTQHSLALS DGRSLAFTAT AGSLALVDEA GKLQAEIAFT AFTLPERPRA TRPVTFALNG
GPGAASAYLN LGAVGPWRLP LDGPSISPSA APVPLPNDET WLDFTDLVFL DPVGTGYSRA
AGDDAKRYFS VDADASVLAA AIARWLRTND RLTSPKFYLG ESYGGFRGPL IARKLQDDVG
VGLSGLVLLS PVLDFGWLQP PRHNPLGDVT RLPSLAAAAI ERRGGSPDPG ALAEAESYAT
GEYLSDLLRG PRDGAARDRL ARRVAALTGL DPDLVRRQAG RISTGSYQRE SGRAEGRVAS
AYDTGVTGWD PEPNAAHAGF EDPLLSAMQA PLSSAIVDLT ARTLNWRVTN LRYELLSTGV
NRQWNWGSGR TPPEVVSDLR QALALDGSLR VLVAHGYTDL VTPYFASRLI LDQIPAYGPG
QRLSLAVFPG GHMFYSRQAS RAALRGEALR LYEAALAARQ GSGEGR