Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3207 |
Symbol | |
ID | 5831499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 3557647 |
End bp | 3558795 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641369007 |
Product | CBS domain-containing protein |
Protein accession | YP_001640665 |
Protein GI | 163852622 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.403327 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAACG ACCGAAGTCG CGGCGCGGCC CTGGCCGCGC CAGCCGCGGA CGCCGAACCG CCCCCACGCG AGGCGTGGTA CGATCGTCTG CTGAACGTCT TCCAAATGCG TCCGCGCGAC TCCCTGCGCA CCGACATCGA GGAGGCCTTG GCCGAGCCCG ACACGGGTGA GGACGCCTTC TCTCCCCTTG AGCGCGCCAT GCTCAAGAAC GTGCTCGGGC TGCACAAGGT GCGCGTCGAC GACGTGATGC TGCCCCGCGC CGACATCGTG GCGGTGGCCA GCGACACCAG CCTCGGCGAT CTCCTGAAGC TGTTCCGCAC CGCCGGCCAT TCGCGCCTGC CGGTCTACGG CGAAACCCTC GACGATCCCC GCGGCATGGT CCACATCCGC GACTTCGTGG AATACCTCGC CACCCAGGCG GAAGCCGCCC CGCGCCGGGC CGCGCCGCAG CCTGTGGTGG CCACCGGCGC CGAGGCCAAG CCCACGCCGC GCCCGCGCCG CACGGCCTCC GCCCGCGGCG CGCTGCGCAG CCTCGATCTC GGCAAGGTCG ATCTCACCGC AACCCTCGCC TCCACCCGCA TCCAGCGCCC GGTCCTGTTC GTGCCGCCCT CCATGCCGGC GATCGACCTG CTGGTGCGGA TGCAGGCCAC GCGCACCCAC ATGGCGCTGG TCATCGACGA GTATGGCGGC ACCGACGGGC TGATCTCGAT CGAGGATCTG ATCGAGATGG TCGTCGGCGA CATCGAGGAC GAGCACGACG TGGCGGAGGG CCAGCTCGTC AACCGCATGG AAGGCGAGAC GGAGGCCTAT ATCGCCGACG CCCGCGCCGG GCTCGCGGAA GTATCGGCGG CAACCGGCCT CGACCTCGCC GCCGCTTTCG GGGAACTCGC CGAGGAGATC GACACGATCG GCGGCCTGAT CGTGACGCTG GCCGGCCGGG TTCCGGCGCG CGGCGAGCGG ATCCCCGGTC CCGACGACAT CGAGTTCGAG GTGCTGGACG CCGATCCCCG GCGGGTGAAG CGGATCAAGC TCCAGCGCGC GCCGGCCAAG ATCGGCACCG TCGTGCCGCT CGCCCTACCG CCGCCCCGCC CGGCGGCGCC GCAGGCACCC GACACGGACG CAGCGCAGGC CGCCGAGGCC GGGCGCTGA
|
Protein sequence | MTNDRSRGAA LAAPAADAEP PPREAWYDRL LNVFQMRPRD SLRTDIEEAL AEPDTGEDAF SPLERAMLKN VLGLHKVRVD DVMLPRADIV AVASDTSLGD LLKLFRTAGH SRLPVYGETL DDPRGMVHIR DFVEYLATQA EAAPRRAAPQ PVVATGAEAK PTPRPRRTAS ARGALRSLDL GKVDLTATLA STRIQRPVLF VPPSMPAIDL LVRMQATRTH MALVIDEYGG TDGLISIEDL IEMVVGDIED EHDVAEGQLV NRMEGETEAY IADARAGLAE VSAATGLDLA AAFGELAEEI DTIGGLIVTL AGRVPARGER IPGPDDIEFE VLDADPRRVK RIKLQRAPAK IGTVVPLALP PPRPAAPQAP DTDAAQAAEA GR
|
| |