Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3812 |
Symbol | |
ID | 5835085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4232899 |
End bp | 4234854 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641369604 |
Product | hypothetical protein |
Protein accession | YP_001641257 |
Protein GI | 163853214 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATCG ACGAAGCCGC CCCGCGCCCA CCGGCCCAGG TGCCGCCCCC CTTGGTGCCC GGTTTGCGCG GGCTCCTGAC CCTCGCCGTC GGCGTCGTGC TCGTGACAGC CCTCTACTTC GGGCGTGAGG TGTTCATCCC GCTTGTGCTG GCGGTTCTCC TCAGCTTCGT CCTGGCTCCT GTCGTCAATC TGCTGCGCCG ACTGAGGCTC GGCCGAGTGC CATCGGTCAT CGTCGCCGTA CTGCTCGCGC TCACGGTCAT CGGCGGCATC GGAGCCGTTA TCGGTACTCA GGTCGCGGGG CTCGCCGGCG ACCTGCCGCA ATACCAAGCC ACCGTGCAGA AGAAGTTCGC CGGGCTGCAG GCAGGTTGGC TCGGCAATGC CAGCCGGATC TTCCAGCGCT TCAACCACCA AGTCCACAAC GCCACCCAGA AGGCGGATGC CGCCGGCACT GCAGCGTCGA CGGGTCCCGC GGGCGACACC CCGAAGGCTC AACTCGTCAA GGTTCAGGAG CCCGATCCGT CACCCTTTAC CCTCGCCGAG AAGGTGCTCG GTCCCATCGT CGAGCCGTTG ACCACGGTCG GCATCGTCCT CGTCGTGGTC GTGTTCCTGT TGCTCCAGCG AGAAGACCTG CGCAACCGAA TGATCCGCCT GTTCGGATCG GGTGACCTGC ACCGGACGAC GATGGCCATG GACGACGCGG CCGGCCGGCT TGGCACATAC TTCCTGGCCC AGCTCGGCAT GAATGCCACC TTCGGCTTCA TCATCGGCCT CGGTCTCTGG TTCATCGGCG TGCCCAATCC GCTCCTGTGG GGCGTGTTCT CCGCGCTGAT GCGCTTCGTG CCCTATATCG GCGCCTTCCT CTCGGCCCTG TTCCCCCTTG CTTTGGCCGC CGCAGTCGAC CCAGGCTGGT CCATGGTCAT CGCAACGGCC ATCCTGTTCC TGGTGGTCGA GCCCCTGTTC GGGCACGTGA TCGAGCCGCT GCTCTACGGG CACTCGACCG GCCTCTCTCC CTTCGCGGTT ATTGTCTCGA CCCTGTTCTG GGGGTTCCTG TGGGGGCCGA TCGGCCTGAT CCTGGCCACG CCCTTCACCG TGTGCCTCGT GGTGCTCGGG CGGCACGTCG ACAGCCTGGA GTTCCTCGAC ATCATTCTCG GCGACCGGCC GCCGCTGACG CCGGTGGAGA ACTTCTATCA GCGCATGTTG GCCGGAGACC CGGACGAGGC ACGCGACCTC GGCGAGGCCA TGCTGAAGGA GCGCTCGCTG TCCTCCTACT ATGACGAGGT GGCACTGAAA GGGCTTCAGC TCGCCGCCAA CGACTACGCT CGCGGGGTCG TGACGCCGGC GCAGCTGGAG AACATCCGCG CTTCGGCACG CTCGCTCGTC GAGGATTTCG AGGATCGACC CGACGTCGAG CCGTCAGGCG AGGACAAAGC GGTCAACCCG AATGAGACGC CGACGCTGGC CGAGCGGACG CACTTCAAGA GCGAGGCCGT CCCTGGGCAG GCACCGCCTC GCGAGATGCT ACCCGAGGCG TGGCGGGGCG AAACACCGGT GCTGTGCGTC GCTGGGCGCG GCCCCCTCGA CGAAGCCTCC TCGGCCATGC TGGCCCAGCT GCTGCGCAAG CATGGACTTG AGGCCCGCGT GGTTCCGTAC GGTGACGTCT CACGCGAGCA GATCCGCAAC CTAGATCTGA CGGGAGTGGC GATGGTGTGC ATCTCCTACC TCGACATCAC CGGCAGCCCG GCTCACCTGC GCTACCTTCT CGAGCGGCTC CGGAAGAAGG CGCCCACCGT GAAGCTGCTG GTCGGTTTGT GGCCCGAGGG CGAGAAGGTG CTGACGGATG CCTCCCTCGG TCGGCAGGTC GGAGCCGACG TCTACGTTTC CTCGCTCCGG CAGGCCGTTG AGGCCTGCCT CGAGGCTGCG ACAGACGACT CCGAAACTGT CGGTCGGGCG GCTTGA
|
Protein sequence | MLIDEAAPRP PAQVPPPLVP GLRGLLTLAV GVVLVTALYF GREVFIPLVL AVLLSFVLAP VVNLLRRLRL GRVPSVIVAV LLALTVIGGI GAVIGTQVAG LAGDLPQYQA TVQKKFAGLQ AGWLGNASRI FQRFNHQVHN ATQKADAAGT AASTGPAGDT PKAQLVKVQE PDPSPFTLAE KVLGPIVEPL TTVGIVLVVV VFLLLQREDL RNRMIRLFGS GDLHRTTMAM DDAAGRLGTY FLAQLGMNAT FGFIIGLGLW FIGVPNPLLW GVFSALMRFV PYIGAFLSAL FPLALAAAVD PGWSMVIATA ILFLVVEPLF GHVIEPLLYG HSTGLSPFAV IVSTLFWGFL WGPIGLILAT PFTVCLVVLG RHVDSLEFLD IILGDRPPLT PVENFYQRML AGDPDEARDL GEAMLKERSL SSYYDEVALK GLQLAANDYA RGVVTPAQLE NIRASARSLV EDFEDRPDVE PSGEDKAVNP NETPTLAERT HFKSEAVPGQ APPREMLPEA WRGETPVLCV AGRGPLDEAS SAMLAQLLRK HGLEARVVPY GDVSREQIRN LDLTGVAMVC ISYLDITGSP AHLRYLLERL RKKAPTVKLL VGLWPEGEKV LTDASLGRQV GADVYVSSLR QAVEACLEAA TDDSETVGRA A
|
| |