Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_1300 |
Symbol | |
ID | 6134543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 1429383 |
End bp | 1430441 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641641581 |
Product | hypothetical protein |
Protein accession | YP_001768252 |
Protein GI | 170739597 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3591] V8-like Glu-specific endopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.457596 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCCGA TCCTCAACTA TGGTCCGATC CTCCCCTCCG CAGAGCACCG GATCGAGGGG TCGGACATGA ATCTGGACAA TTCGGGCCCG CCGCAGCCCG CCCGGGCGCG GCGCGAGGAG AGGCCGCTGA CCTGGGACGA GTTGCGCGAC CGGGATCCGT GCATCGAGGC GAGCGAGCCG GTGCCGGAGA CCCTGCGCCA CCTCTGGTCC GACCTCTCCG TCGCCAGGGC GGTCCCGGAC CAGACGACGC GCAAGGGGAT GGGCAGCCCG CGCGAGCGCA TCGCGCCGCA TCGCCCGGCC TGGGTCTCGT CCGCCGTGAG GCCGGTGCAG GCACCCCGAA GGCTGCCGCC CCGCCTGGTC CATCGCGGGA TCCCGGTCGA CCCGCTCGTC GTCTGGGGGG AGGACGACCG GCGGAGCTAC GACGACACCC GGTACCCGTG GGGCTGCGTG TGCAAGATCC TGAGTTCCGG GAAGACCGGG TCGGGCGTGC TCGTCGGCCC GCGCCACGTC CTGACGGCGA GCCACGTGGT GAATTGGGGC GGCACCGCCG AGACGATCGA GGTGCACCGG GCCGGCGCGA CCGCCGCGGC GACCGCCAGG ACCGTCCGGC GGTGGACCTT CACCAAGATC ACCGGCGATC CCGGCGCCAG CACGGTCGAC GAGGATTACG CGGTCCTGGT GGTCGATCAG CGCCTGGGCG ACCGCTTCGG CTGGATGGGC GTGCGCACCT ACGACAGCGC CTGGGACGAG GAGGATTGGT GGTGGAACAT CGGCTATCCG GACGACGTGT CGGCGGGCCT GTTCCCGATC TACCAGCGCA ACAAGAAGCT CGACGAGGAC GCGTGGGATT ACGGCTCCGG CCGGGCGATG ACGACCGCGG CCGACCTGAT GCCGGGCCAG TCCGGCGGGC CGATGTTCGG GTTCTGGTCC GACGGTCCCT TCGTGGTCGC GGTCGTGTCG GCCGTCGGGA ACGTCTTTCT GACCGGGACC GAGAATTACT GCTCGGGCGG GTCCGATCTC ACGAGCCTCG TGAGCCAGGC GCGGAGCGGC GATCCCTGA
|
Protein sequence | MLPILNYGPI LPSAEHRIEG SDMNLDNSGP PQPARARREE RPLTWDELRD RDPCIEASEP VPETLRHLWS DLSVARAVPD QTTRKGMGSP RERIAPHRPA WVSSAVRPVQ APRRLPPRLV HRGIPVDPLV VWGEDDRRSY DDTRYPWGCV CKILSSGKTG SGVLVGPRHV LTASHVVNWG GTAETIEVHR AGATAAATAR TVRRWTFTKI TGDPGASTVD EDYAVLVVDQ RLGDRFGWMG VRTYDSAWDE EDWWWNIGYP DDVSAGLFPI YQRNKKLDED AWDYGSGRAM TTAADLMPGQ SGGPMFGFWS DGPFVVAVVS AVGNVFLTGT ENYCSGGSDL TSLVSQARSG DP
|
| |