Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_0736 |
Symbol | |
ID | 6130437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 842548 |
End bp | 845475 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641641054 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001767729 |
Protein GI | 170739074 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGATG CGGGCGAGGC GGATCTCGGG GCCGCCGATG CGGGATTCGC GACCACGGCG GCGCATAGCG AGCGCGACGT CGCCCAGGGC GTCACGCGCT GGTTGATCGC CAATTTCCGA TCCGAACTCG CGGACGCGAC GCGATCGACG CCGTTCACGG TGCCGCTGCT GTGCGCCATC GCCTGCCGCG AGGCCGGCAT GTACTGGCTC CCGCTGACGC CGCACAGGGG GGGCGCGGAG ATCCTCGGTC TGTGCGTGTA CGACGCCAGC GGCGACGTGG CCGGCGCGCC CCGAACCGCG TTCCCGATCA ATACCGCTCA GTTCAGACTC ACGTATGGGG ATGCATTCAC GCAGCTCCTG ATCGCCGAAA CGAACAAGGC GAGGGCCGCG CGCGGCCTCA GCCCGGCCTC GATGATTTAC AAAGGTTACG GTATATTTCA GTACGATCTG CAGCATGTCC GCACGGACGA AGCGTTCTTT CGTCAGAAGA AATGGTATGT CTTCGAAGAG TGTGTGAGCA GAGTCGTATC TGAGCTTACA AGCAAGTATG AAGCGACCGG GAACATTCAG GAAGCTGTCC GCGCTTACAA CGGATCTGGC CAGAAAGCCC GTCAATACGC TCAGGATGTG ATGAGATTGC TGCCATATTG TGAGGAGGCG GCGGGTTCTC CACAGATTGC ATCGCTCGCC TCTGCATCGC TCTCCTCTGC GGCGCGCGAT GGCGGGGCGG TCCTCGGCGC GATGGATCCG CAGGACGCCT CGGACGACGA CCCGGGCGCC CCCGCTCCGA CCGACGTCAC CGAGGTCGCG GACGAGGACA CCGCCCGCCT CCTCGCCAAT CTCGGATACT CGGTCGATGC GGAGAGTGCC GCGCCGGCGG AACCGGGCGT CGCGGCCGTC GGCGACACGG CCTTCGACCT CGCGCGGGCG CGGGCCTTCC TCGACGCGTG CCGGACCGCG AGGCCGCGCG TCACCTACGG CCTCGGGCAG AAGGTGCCGT TCCTCGATGC CGTCCCGGGA CGCGACTTCA CGCAGGTCGA TTGCAGCGGT TTCGTCCGAC AGGTCGTCCG GCTCGCCACG ACCCCGTCGC TGCGTTTCCC CGACGGCTCG GTCAATCAGC ACCAATGGGC GCGCGCGAGG GGTTTGGAGA CCTCGTCCGT GGCGGAGGGC AGGGCCACGG ACGACGTGGT CCGGATCGCG TTCCTGCGGC CGCAGGACGC CGGTCGCAAG AGGATCGGGC ACGTCGTCCT GATCGCCAAC GGCGAGACGC TCGAATCCCA CGGCGGCGTG GGTCCGGATT CGCGGCCCTG GACGGGGACC GGCTGGCAGG CGAAGGCCTT CGTCTACGTC CTGGCTCGCG ACGCTCGGAT CCGCGCGGCG GCTTCGGCGG AGCGGCTCGC GGCGGCGCGG ACCGAGTCCG TGGCCAAGCC CAGCATCGTT CGGACCCTCG AGAGCGCGAC GATGCGCCGG ACCACGGCCA GCATCTTCAC CACGCAGGCT CTGCCGCAGC ACCACGACGA CATCCTCGTC GTGACCATGC GGCCCGGACC GGCGGAGGCT CCCGCGGCGG CGGGGATGGC GATGGGGATG GGGATGGCGC CGGCGCCGGA GACGCCCGGC CTGGGTGCGC TGTCGTATTT CGCCCGCGCG GGACGCATCA AGCGGGTCGT TCCGCTGCGC GCGAGCGAGG AGGCGGTGAC GGCGCCGTCC CCGATGGCCG CGGCCGCCGC GATGATGGGG TTCCACCGTC CCGCCGGAGC ACCCGACGTC GGCGCCCCGG TCCGCTTCAT CGAGATGATG GACGGTCAGG ACGCGAAGCA GCTGCACGGC GCTCTGGCGA GCGACCCGAG CGTTCTCTCG GTGTCGCAGG TTCCGGTCCG CTACCTCGCC GCGCGGCGCG CGGGGCGCAC GGCCGCCGGC GGCGGTCTCG GGATCGCCGC GGCGCCGCCC GCGGCATCGC TGCTCTGGAA CCTCGCCAAG ATCCGCTGGC AGGAGGCGCG TGCGGCTGCC GGCTTCCAGG AGGCAACCCG GGTGAGGGTC GCGGTGCTGG ACACCGGCGT CGATGCCAAG CACCCGAGCC TGCGGGTGTC CAATTATTAC TGGCAGAACG CTGATCTGAC GCGGCCCGTC TCGGAGCTCG ATCTGATCGG GCACGGGACG CATGTCTCGG GAACGATCGC TGCGCTGATC GCCAGCGGCG TCTCGGTTCA AGGGGTGTGT GCCTGCCAAC TTGATGTCTG GAAGATCTTC GATGATGAGC CGACCTACGC GCCCGGCCAA GGAGCTTTCG TCTATTACGT GAATCCGATC CTGTACCGTC GTGCCTTGGC CGCGTGCGTG GACGACCCGC CCCACGTCGT GAACCTGAGC ATCGGCGGGC CGGCCGTTCC CGATCCGACC GAGCGGACCC TGTTCGAACA GCTGCTCGCG TCCGGCGTGA CGATCTGCGC GGCGATGGGC AATGATCGCC AGTATGGCAG CCCGACCTCG TACCCGGCCG CGATACCGGG GGTCGTCGCG GTCGGCGCGA CGGGGCTGGA CGACAGGGTG ACGCTCTTCT CGAACAGCGG AAACCACATC GCGGTCGCGG CGCCCGGGAA GGCCATCTGG TCGACGCTCC CCCGGTATGA CGGCCAGACC GCCTTCGGCA TCGCGTACGG CCCGGACGGG CGGCCGCAGC CGGGAGCCAG GGTCCGCCGC GAGTGCAACT ACGACGCCTG GGATGGAACT TCCATGGCAA CACCCCATGT GACGGGGAGC GCGGCGCTCC TGATCGCCAA GAGCATCGCT GCCGGTGGCG AACTCAAGCC CGATCAGGTG AGAGCCGCCC TGATGACGTC GGCCGACAAG GTGCAGGCGA TGAATGGAGC GGATTTCAGC GCCGACTACG GTGCGGGGCG GATCAACCTG CTCAAATTGT TGCAATGA
|
Protein sequence | MTDAGEADLG AADAGFATTA AHSERDVAQG VTRWLIANFR SELADATRST PFTVPLLCAI ACREAGMYWL PLTPHRGGAE ILGLCVYDAS GDVAGAPRTA FPINTAQFRL TYGDAFTQLL IAETNKARAA RGLSPASMIY KGYGIFQYDL QHVRTDEAFF RQKKWYVFEE CVSRVVSELT SKYEATGNIQ EAVRAYNGSG QKARQYAQDV MRLLPYCEEA AGSPQIASLA SASLSSAARD GGAVLGAMDP QDASDDDPGA PAPTDVTEVA DEDTARLLAN LGYSVDAESA APAEPGVAAV GDTAFDLARA RAFLDACRTA RPRVTYGLGQ KVPFLDAVPG RDFTQVDCSG FVRQVVRLAT TPSLRFPDGS VNQHQWARAR GLETSSVAEG RATDDVVRIA FLRPQDAGRK RIGHVVLIAN GETLESHGGV GPDSRPWTGT GWQAKAFVYV LARDARIRAA ASAERLAAAR TESVAKPSIV RTLESATMRR TTASIFTTQA LPQHHDDILV VTMRPGPAEA PAAAGMAMGM GMAPAPETPG LGALSYFARA GRIKRVVPLR ASEEAVTAPS PMAAAAAMMG FHRPAGAPDV GAPVRFIEMM DGQDAKQLHG ALASDPSVLS VSQVPVRYLA ARRAGRTAAG GGLGIAAAPP AASLLWNLAK IRWQEARAAA GFQEATRVRV AVLDTGVDAK HPSLRVSNYY WQNADLTRPV SELDLIGHGT HVSGTIAALI ASGVSVQGVC ACQLDVWKIF DDEPTYAPGQ GAFVYYVNPI LYRRALAACV DDPPHVVNLS IGGPAVPDPT ERTLFEQLLA SGVTICAAMG NDRQYGSPTS YPAAIPGVVA VGATGLDDRV TLFSNSGNHI AVAAPGKAIW STLPRYDGQT AFGIAYGPDG RPQPGARVRR ECNYDAWDGT SMATPHVTGS AALLIAKSIA AGGELKPDQV RAALMTSADK VQAMNGADFS ADYGAGRINL LKLLQ
|
| |