Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4216 |
Symbol | |
ID | 5833284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4691401 |
End bp | 4692858 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641370007 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_001641656 |
Protein GI | 163853613 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0428969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.525681 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAAAACC GCGTCCCTCC CACGGCCCCT CGTCTTGTGT CGCTCGGGCC GGGCCGGCTA TGGGGCTTGG CGGCGGGATC ATGCGAGGCG TTTGCGGTGC GACAGGGCGA TGCGGAGGGC GATGATCCGC GCGTCGCAGC ACTGCTCGCG CGTCTCTCGG ACGACAAGGT GCCGTTCTCG ATCGCCGGGC GCGACGACTT GGCCGGGCTC ATCCTGTCGC GCGCCGCGCT GAACGGTCAC ATCGATCCGA CGGCGCCGCC GCGCTGGTGG ACGCAGGGCG CGGGCGGGCT CGATCTGGCC GGCGCCGACC TCGCCGACGC CCGGCTGGAG ATGACCGATT TTTCCGACGC CAACCTGCGT CGCGCCTCGC TCGCCGGCGC GCTTGCGCGC TCGGCCGGCT TCGCGAATGC CTGCCTGGAG GAAGCGGACT TTGCCGGCGC CGACCTCAGC GGCGCGCGCT TTACCGGAAT TGCCGGCGGG CAGGCCTCCT TCCGCGAGGC GATGCTGGAG GATGCCGACT TCTCCGGCGC CACCATGCGC TTTGCCCGGC TCGACAAGGC GCTCCTCGAC GGCGCCCGCT TCGAGGGCGC CGATCTGTGG GGCACCGACT TCACCGGGGC GGATGCCGAC GATTCCGTGT TCCGAAAAGC CCGGCTCGAC GAGGCCAACC TCTCCGACTG CAATCTCACC GGCGCGGATT TCGAGGGGGC GAGCCTGAAG AAGGCGCGGC TCGTCGGCTC GCGGCTGCGC GGCGCCAACT TCTCCGGAGC CCACCTCGAC GGGGCGGACC TGTCGGGGGC CGACTTCTCC CGCACCAGCC TCGTGCGGCT CGACCTCACG ACATGCAAGC TGCACCGCGC GCGCTTTGCC GGCGCATGGC TGGAAGGCGT GCGGCTTACC GTCGAGCAGA TCGGCGGGAT GGTCGGCGAA GAGGCGGCGG GCGAATACGA GGCGGCGCAG GCGAGCTATC TCGCGCTCGA GCGCAACCTT CAGAGCATCG GCAGCCCCGA GGGCGCGAGC TGGGCCTACA AGCGCGGGCG CCGCATGGGC CGCCGCCATG CCGGCGTGCG GGCCCGCGAG GCCTTTTTCG CCCGTGATGT GCGGGGAACG CTGAGCTCCG GTTACCGCTG GATCGCCGAC CGCTTCGTCG AGTGGCTGTG CGACTACGGC GAGAGCCTCT CGCGGATCGC CCGCGCCTTC CTCGTCGGGA TCTTCCTGTT CGCCGGGGCC TATGGGGCGA CGGGCGGGCT CTTCCACGAG GGTGAGAACG CGCCGACCTA CAACCCGCTC GATCTCGTGA GCTACAGCGC GCTCAACATG ATGACCGCCA ACCCACCCGA GATCGGGGTG AAGCCGCTGG GCCGCGTCAC CAACCTGCTG GTCGGGTTGC AGGGGGCGGC GGGGATCGTG CTGATGGGGT TGTTCGGCTT CGTCCTCGGC AACCGCCTCC GCCGCTGA
|
Protein sequence | MQNRVPPTAP RLVSLGPGRL WGLAAGSCEA FAVRQGDAEG DDPRVAALLA RLSDDKVPFS IAGRDDLAGL ILSRAALNGH IDPTAPPRWW TQGAGGLDLA GADLADARLE MTDFSDANLR RASLAGALAR SAGFANACLE EADFAGADLS GARFTGIAGG QASFREAMLE DADFSGATMR FARLDKALLD GARFEGADLW GTDFTGADAD DSVFRKARLD EANLSDCNLT GADFEGASLK KARLVGSRLR GANFSGAHLD GADLSGADFS RTSLVRLDLT TCKLHRARFA GAWLEGVRLT VEQIGGMVGE EAAGEYEAAQ ASYLALERNL QSIGSPEGAS WAYKRGRRMG RRHAGVRARE AFFARDVRGT LSSGYRWIAD RFVEWLCDYG ESLSRIARAF LVGIFLFAGA YGATGGLFHE GENAPTYNPL DLVSYSALNM MTANPPEIGV KPLGRVTNLL VGLQGAAGIV LMGLFGFVLG NRLRR
|
| |