Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_5407 |
Symbol | |
ID | 6133730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 5940547 |
End bp | 5942127 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641645541 |
Product | peptidase S10 serine carboxypeptidase |
Protein accession | YP_001772157 |
Protein GI | 170743502 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.451508 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGCA TCCTGATGCT GCTCGCCGCG CTCGCCGCGG CAGCGCCCGG CCCCTCGCTC GCGCAGCAGC CGCGGCCGCA GGACCCGCGC GCGCAGGAAT CGCGCGCGCC CGAGTCCCGC CCGTCCGAGT CCCGCCCGTC CGAGTCCCGC CCGCCCGAGG GGCGCAGGCT GCCCCCCGAC GCGGTGACGC AGCACAGCCT CGCCCTCTCG GACGGGCGCA GCCTCGCCTT CACGGCCACG GCCGGCAGCC TCGCCCTCGT CGACGAGGCC GGCAAGCTCC AGGCCGAGAT CGCCTTCACG GCCTTCACCC TGCCGGAGCG CCCGCGGGCG ACGCGGCCCG TCACCTTCGC GCTCAACGGC GGTCCCGGCG CGGCCTCCGC CTATCTCAAT CTCGGGGCGG TCGGCCCCTG GCGCCTGCCC CTCGACGGGC CGAGCATCAG CCCCTCGGCG GCGCCGGTGC CGCTGCCCAA CGACGAGACC TGGCTCGACT TCACCGACCT CGTCTTCCTC GATCCGGTCG GCACCGGCTA CAGCCGGGCG GCCGGGGACG ACGCCAAGCG CTACTTCTCG GTCGATGCGG ACGCCTCGGT CCTCGCCGCG GCGATCGCCC GCTGGCTGCG CACCAACGAC CGCCTGACCT CGCCGAAATT CTACCTCGGG GAGAGCTACG GCGGCTTCCG CGGCCCGCTC ATCGCCCGCA AGCTCCAGGA CGACGTCGGG GTCGGCCTCT CGGGCCTCGT TCTGCTCTCG CCCGTGCTCG ATTTCGGCTG GCTTCAGCCG CCGCGGCACA ATCCCCTCGG CGACGTCACC CGGCTGCCCT CCCTGGCCGC CGCCGCCATC GAGCGCCGCG GCGGCAGCCC GGACCCGGGG GCGCTGGCGG AGGCCGAATC CTACGCCACC GGCGAGTACC TGAGCGACCT ACTGCGCGGG CCGCGGGACG GCGCGGCGCG CGACCGGCTC GCCCGCCGGG TCGCGGCGCT CACCGGGCTC GACCCGGACC TGGTGCGGCG GCAGGCCGGG CGGATCTCGA CCGGCAGCTA CCAGCGCGAG AGCGGCCGGG CGGAGGGGCG CGTCGCCAGC GCCTACGACA CCGGCGTCAC CGGCTGGGAC CCGGAGCCGA ACGCGGCCCA TGCGGGCTTC GAGGACCCGC TCCTCTCCGC CATGCAGGCG CCCCTCTCGA GCGCGATCGT CGACCTCACC GCCCGCACCC TGAACTGGCG GGTGACGAAT CTGCGCTACG AGCTGCTCAG CACCGGCGTG AACCGCCAGT GGAACTGGGG CTCCGGCCGC ACCCCGCCGG AGGTGGTGAG CGACCTCAGG CAGGCCCTCG CCCTCGACGG GTCGCTGCGG GTGCTGGTCG CGCACGGCTA CACGGACCTC GTCACGCCCT ACTTCGCCTC GCGGCTCATC CTCGACCAGA TCCCCGCCTA CGGTCCGGGC CAACGGCTCA GCTTGGCGGT TTTCCCGGGC GGCCACATGT TCTATTCCCG CCAGGCCTCG CGGGCGGCGC TCCGGGGCGA GGCGCTGCGC CTCTACGAGG CGGCGCTGGC GGCGCGGCAG GGAAGCGGCG AGGGACGATG A
|
Protein sequence | MIRILMLLAA LAAAAPGPSL AQQPRPQDPR AQESRAPESR PSESRPSESR PPEGRRLPPD AVTQHSLALS DGRSLAFTAT AGSLALVDEA GKLQAEIAFT AFTLPERPRA TRPVTFALNG GPGAASAYLN LGAVGPWRLP LDGPSISPSA APVPLPNDET WLDFTDLVFL DPVGTGYSRA AGDDAKRYFS VDADASVLAA AIARWLRTND RLTSPKFYLG ESYGGFRGPL IARKLQDDVG VGLSGLVLLS PVLDFGWLQP PRHNPLGDVT RLPSLAAAAI ERRGGSPDPG ALAEAESYAT GEYLSDLLRG PRDGAARDRL ARRVAALTGL DPDLVRRQAG RISTGSYQRE SGRAEGRVAS AYDTGVTGWD PEPNAAHAGF EDPLLSAMQA PLSSAIVDLT ARTLNWRVTN LRYELLSTGV NRQWNWGSGR TPPEVVSDLR QALALDGSLR VLVAHGYTDL VTPYFASRLI LDQIPAYGPG QRLSLAVFPG GHMFYSRQAS RAALRGEALR LYEAALAARQ GSGEGR
|
| |