Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2100 |
Symbol | |
ID | 5833207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2353112 |
End bp | 2354374 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641367897 |
Product | hypothetical protein |
Protein accession | YP_001639566 |
Protein GI | 163851523 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.281286 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.00458205 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAATCA AAACAAAGCT TCTGGCCGCG ACCGCGGTGC TTTCCACCCT GAGCATCAAC GCGTTGCCAG TCCTCGCCGC AGATATGCCT GCCGCCAAGT CGGCGCCCGT CATCGTCGAG GAGCATTGTA AGGCTGCGAT CTCCACCCCG ACCTTCGGCG GTCTCATCAA GGCGAACCCG AACCCGGCCT GCATCGTGAC GGGACTGGGC GACATCTATG TCGGCGGCGC GGTCACCGGC TTCGCCTACA CCCAGACCAA CGCCTTCGGC ATCCTCTCGC CCAGCGCTGA GCAGGACCGC TTCGGCCGCG TCGACTTCTC GAACCTCCAG GGCTGGATCC AGAAGGCCGA CGGCCCGCTG CAATTCTACG TCCATGCCGG CCTGTACTCG ATCCCGGCGC TCGGCCTGCC GCTCTACTCC GCGTTCGAGC AGACCGAATC GCTGTTCGGC CCGATCCCGG TGGCCTTCGG CAAGTGGCAG ATCAACGACG AGTGGTCGAT CCAGGCCGGT CGGATGTTCA CCAACATCGG CTCCGAGCTG CTGTTCACCT ACCAGAACCT GAACATCTCC CGCGGTCTGC TGTTCAACCA GGAGAACTTC ATCAACCACG GCGTCCAGGT GAACTACGCC AACGGCCCGT TCGCGGCCGC TCTCGCGGTG ACCGACGGCT TCTACTCGGG TGAGCTGAAC TGGGTGACGG GCTTTGCCAC CTACAAGCTC AACGACGCGA ACACGATCGG CATCAACGGC GGCACGCATT TCAGCGATTT CGACGCCTCG ACCCGCAGCC CGCGCTTCCA GTTCGCGACG ATCAACTCGC TGCAGAACAG CAGCATCATC AGCGTGAACT ACACCTACGC CAACGGGCCG TGGATCATCT CGCCGTACTT CCAGTACACA AACGTCGCGC GCAAGGAGGG GTACTTCTCC CCGATCGAGG GCGCGGAGAC CTGGGGCGGC ACGCTGCTGG CCGGCTACAC CTTCACCGAC AACTTCGCGC TCGCCGGCCG CCTCGAATAC ATCGAGCAGT CGGGCACGCG GGGCGTGGTC ACCGGCCGCG GCGGCACCAG CGTCCTCTAC GGTCCGGGCA GCTCGGCCTT CTCGTTCACG ATCACCCCGA CCTTCACCTG GGATCGCTAC TTCCTCCGCG GCGAGTTCGC GACCGTCCAG GCCTACGACG TGACCCCCGG CTTCGGCTTC GGCCGCGACG GCACCAAGCG CTCGCAGGAG CGCTACCTCG TGGAGACCGG CTTCACCTTC TGA
|
Protein sequence | MTIKTKLLAA TAVLSTLSIN ALPVLAADMP AAKSAPVIVE EHCKAAISTP TFGGLIKANP NPACIVTGLG DIYVGGAVTG FAYTQTNAFG ILSPSAEQDR FGRVDFSNLQ GWIQKADGPL QFYVHAGLYS IPALGLPLYS AFEQTESLFG PIPVAFGKWQ INDEWSIQAG RMFTNIGSEL LFTYQNLNIS RGLLFNQENF INHGVQVNYA NGPFAAALAV TDGFYSGELN WVTGFATYKL NDANTIGING GTHFSDFDAS TRSPRFQFAT INSLQNSSII SVNYTYANGP WIISPYFQYT NVARKEGYFS PIEGAETWGG TLLAGYTFTD NFALAGRLEY IEQSGTRGVV TGRGGTSVLY GPGSSAFSFT ITPTFTWDRY FLRGEFATVQ AYDVTPGFGF GRDGTKRSQE RYLVETGFTF
|
| |