Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3552 |
Symbol | |
ID | 6133959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 3962914 |
End bp | 3964161 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641643719 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001770367 |
Protein GI | 170741712 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0620959 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCGC CGGCCGGGCC CGGCGCTCGC CCGGCGGCCG CGCGGGAGGC GCCGCCCCGC GCCGAGGGGC CCCTCGACCT CGCCGGGGCG GAGCTGCCGC CGGGGACCGA CCTCGTCGAG GCGGATCTCT CGGGCGCGAA CCTCGCGCGG GCGCGCCTCT CCGGAATCCT CGCCCGCTCC GCGCGCTTCG ACGGCGCCCG CATCGAGGAG GCGGATTTCA GCGGGGCGGA CCTCAGCGGC GCGCATTTCG CGACCGTGGC CGGCGGCCAG GCCCGCTTCT CCGGCGCCAT GATGGAGGAT GCGCGCTTCT GCGGGGCCAC CCTGCGCTTC GCGACCCTGG CGGGCGGGCT CCTCGACGGG GCCGACTTCA CCGGCGCCGA CCTCTGGGGG GCCGACTTCA CCGACGCGGA TGCCGACGAC ACGCTGTTTC GCCGCGCCCG GCTCGACGAG GCGAAGCTCG TCAACGTCAA CCTGACCGGG GCGGTGTTCG AGGGGGCGAG CCTCGCCAAG GCGTCCCTCG CCGGCTCGCG GCTGAGCCGG GCCAACTTCG TCGAGGCGAA GCTCGACGGG GCCGACCTGT CGGGGGCCGA CCTCTCGGAT GCCCGCCTCG TGCGGCTCGA CCTCACCTCC TGCCGGTTGC GGCACGCGCG CTTCGCGCAT GCGTGGCTGG AGGGCACGCG CCTGCGGGTC GACCAGCTCG GCGGCGCGGT CGGCGAGGAG GTGGCGGGGG ATTACGCGGC CGCGGTGGCG AGCTACCTCG TGGTGGAGCG CAACCAGCGC AGCATCGGCG ACCGGGAGGG GGCGAGCTGG GCCTTCAAGC GCGCCCGCCG CATGGGCCGC CACCACGCGG GAGCGCTGAC CCGCGCCGCG TGGCGGGAGG GGGCGTGGCG CGCGGGGCTT CGCCACGGCT CCGACTGGCT CTCGGACCGC TTCGTCGAGT GGCTGTGCGA TTACGGCGAG AGCCTGACGC GGATCGTGCG CGCCTTCGCG TTGGCGATCC TGGTCTTCGC GGCGCTCTAC GGGGTGACCG GGGGGCTGAT CCCGGAGGGC CGGGACGGCG TGCCGACCTA CAACCCGCTG GACCTGGTGA GCTACAGCGC CCTCAACATG ATGACGGCGA ACCAGCCGGA ACTCGGCATC AAGCCCGTGG GGCGCGTCAC CAACATCCTG GTCGGCAGCC AGGGGGCGCT CGGCATCATC CTGATGGGCC TGTTCGGCTT CGTCCTCGGC AACCGGCTGC AGCGCTAA
|
Protein sequence | MSAPAGPGAR PAAAREAPPR AEGPLDLAGA ELPPGTDLVE ADLSGANLAR ARLSGILARS ARFDGARIEE ADFSGADLSG AHFATVAGGQ ARFSGAMMED ARFCGATLRF ATLAGGLLDG ADFTGADLWG ADFTDADADD TLFRRARLDE AKLVNVNLTG AVFEGASLAK ASLAGSRLSR ANFVEAKLDG ADLSGADLSD ARLVRLDLTS CRLRHARFAH AWLEGTRLRV DQLGGAVGEE VAGDYAAAVA SYLVVERNQR SIGDREGASW AFKRARRMGR HHAGALTRAA WREGAWRAGL RHGSDWLSDR FVEWLCDYGE SLTRIVRAFA LAILVFAALY GVTGGLIPEG RDGVPTYNPL DLVSYSALNM MTANQPELGI KPVGRVTNIL VGSQGALGII LMGLFGFVLG NRLQR
|
| |