Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3606 |
Symbol | |
ID | 5831960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 3984648 |
End bp | 3986588 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641369399 |
Product | hypothetical protein |
Protein accession | YP_001641055 |
Protein GI | 163853012 |
COG category | [S] Function unknown |
COG ID | [COG5338] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGCTA CGCGGCGGCT GCTCACGGCG GGACAGGGCG AGCGGGCGCC CCGCAACGGC ACGTTAACCA TGCCCGGTGT AGCCCCGTTA ACCATGGAGC GCCGGCCGGA CACGGAACGC CGCCCCGTCC ACGGGCCGCC GGCCGTATCC AGGGAAAGAG CGCTGCCGGT GTCACGCGAG CGGCCGAACG GGGACAAGCG AGGGGGGCGC CGCGGGGCGT TCGCTCTGGC GCTGCTCCCC GCCCTGCTCG GGGGCCCCGC GCTGGCGCAG GAGACGCCCG ACCCGTCGCA GGCCGAGCCC GCCCCGAGCA CGGCGTCGAA CCCGCTCGCG CGCAGCCCCT CCGAGCCGGC GACCGCGGGC CGCCCGCGCC GCTCCGCCTT CGACGCGCCG ACGGCTTTGC GCGGCTCCGG CGCCTTCTCC GCCCCCGCCC CGCTCGGCAG CGGCACGACC GCGAATCCGG CCCCCGCCTC CGAGGAGAGC GAGGAGCCGA GTGTGGCCCG CCTGCCGCGC TTCCGCAGCG CGACGAGCCT GCCCGGCTCC GCCGCGACGC GGGGCACGCC GGCCCGGCCC TCGATCTTGC GCCTGCGCGC GGCGCCCCCG CGCCGGCTCG GCACGGCGAC CCGCCAGATC ACCCAGACGC GCACGCAGCA GACGATCACG GATCTGCGCC TCACCCCGGT GATCCAGACG CCCGTCTCCG GCGTGCCGCT GCCGGCGCCG ATCCTCGGGC TCGGCCTGCC GAACGCCGCC GGGCTGCTGC TCGGCACGGC GCTCCGCCGG CCGATCCCCG CCGACACCGC CTACGCGCCG CTCGGCATCC GGCTCGGCAC CTTCACGCTG CTGCCGGCCT TCACCCAGAG CGTCGGCTAC GATTCGAACC CGGACCAGAT CGGCGGCACC CGCCTGCGGC CCTCCCTGGC GCTGCGCAGC GAGGCGGAGC TGGCATTGCG CAGCGAGTGG TCGGCGAGCG AACTCACCGC CGAGATGCGC GGCAGCTACC TCGAATATCC GCAGAACCCC GAGGCGAGCC GCCCCAATGC GGTGGGCACC GCGCGGATGC GCATCGACGT CGACCGCGAC ACCCGCATCG ATCTGGAGAC CCGGTTCCTG CTCGACAGCC AGCGCATCGG CAGCCCGGAT CTCGGCGCGG GCGGGGGGGC GACGACCCGG CCGCTCTTCG CCACCTACGG CGCGACCGCG GGCGTGCAGG AAAACTTCAA CCGGCTGCAG CTCTCGCTGC GCGGCTCGAT CGACCGCTCG GTCTTCGAGG ATGCGCAACT CGGCAACGGC ACCACAATCA TCCAGAGCGA CCGCGACGCC AACCAGTACG GCCTGCGCCT GCGCGCCGGA TACGAGATCT CGCCCGCGAT CACGCCCTTC GTCGAGACCT TCCTCGACAC CCGGGTCTAC GACACGCCGG TGGACCAGTT CGGCCTGCGC CGCGATTCCG ACGGCGTCGC CTTCACCGCG GGCGCGGCGG TGGCGCTCAA CAGCACGCTA ACGGCGGAAA TCTCGGGCGG CCTGCAGCAC CGCTCCTACA TCGATCGCAC CCTGCAGGAC ATCAACGCGC CGGTCGTCAA CGCGGCACTC ATCTGGTCGG TCTCGCCGCT GACCACGGTG CGGTTCAACC AGCAGACCGG CGTGATCGAG ACCGCGGTGC CGGGCTCCAG CGGCGCCTTC ACCGACGCCG CCACGCTCGA AGTGCAGCAC GACCTCTTGC GCAACCTCTC GATCACGCTG GGCGGCGCTT ACCTCTCCAC CAACTACGAC GGCGTGCGCA TCCGCGAGCG GGGCTACTCC GCCACCGCCC GGCTCGACTA CCGCTTCAAC CGCTGGCTGG CTCTCCGCGG CAGCTACATC TACTCGACGC TGAACAGCAC CGTCCCGCTC TCGACCTACG AGGCGCACAC GGTGCTGCTC GGGGTGCGGG TGAACCCCTG A
|
Protein sequence | MGATRRLLTA GQGERAPRNG TLTMPGVAPL TMERRPDTER RPVHGPPAVS RERALPVSRE RPNGDKRGGR RGAFALALLP ALLGGPALAQ ETPDPSQAEP APSTASNPLA RSPSEPATAG RPRRSAFDAP TALRGSGAFS APAPLGSGTT ANPAPASEES EEPSVARLPR FRSATSLPGS AATRGTPARP SILRLRAAPP RRLGTATRQI TQTRTQQTIT DLRLTPVIQT PVSGVPLPAP ILGLGLPNAA GLLLGTALRR PIPADTAYAP LGIRLGTFTL LPAFTQSVGY DSNPDQIGGT RLRPSLALRS EAELALRSEW SASELTAEMR GSYLEYPQNP EASRPNAVGT ARMRIDVDRD TRIDLETRFL LDSQRIGSPD LGAGGGATTR PLFATYGATA GVQENFNRLQ LSLRGSIDRS VFEDAQLGNG TTIIQSDRDA NQYGLRLRAG YEISPAITPF VETFLDTRVY DTPVDQFGLR RDSDGVAFTA GAAVALNSTL TAEISGGLQH RSYIDRTLQD INAPVVNAAL IWSVSPLTTV RFNQQTGVIE TAVPGSSGAF TDAATLEVQH DLLRNLSITL GGAYLSTNYD GVRIRERGYS ATARLDYRFN RWLALRGSYI YSTLNSTVPL STYEAHTVLL GVRVNP
|
| |