Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2217 |
Symbol | |
ID | 5832754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 2459604 |
End bp | 2460701 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641368016 |
Product | CBS domain-containing protein |
Protein accession | YP_001639683 |
Protein GI | 163851640 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.332679 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCTCGC TGGCGCTCGG CTGGGTCGGC GAGCCGGCCC TGGCCCACCT GATCGAACCG CTGCTCTACT GGCTGCCGGA GAAAGCCGCC GGCCTCAGCG CCCATACGCT GGCCGTGGCT TTGGCCTTTG CCGTCATCAC CACGCTGCAC ATTGTGCTGG GTGAGCTGGC GCCGAAAAGC CTCGCGCTCC AGCGCAGCGA GCGCACGGCG CTGGTGATCG TGCGGCCCCT GACGCTGTTC CTGTTGGTGT TCGGCCCTGC GATCCATCTC CTCAACAGCC TCGGCAACGG GGTGCTGCGG CTCCTCGGCC TCAGGCCGGG CGAGGGTGAG GGAAATCTTC CCTCCCCCAA GGAACTCAGC CTGCTCGTCA CCGCGAGCCA CGAGGCGGGC CTGCTGCACG AGGCGCAGGA GGATGCGGTG GCGCGCATCC TAGCCATCGG CGAGCGGCGC ATCCGCGAGA TCATGACCCC GCGCAACGAG GTGGACTGGG TCGATCTCGA CGATTCTCAG GAGGAGATCA TCGAAGCGGT GCGCACCTGC CGCCACGAGC AGCTCGTCGT GAGCCGGGCG CAGATCGACG ATGTCGTTGG CGTGCTGCGC AAGCAGGATC TGCTCGATCA GTTCCTCGAC GGCAAGCCGC TCGACGTGAA GGGGGCGACC CGCGAGCCCA TCGTCGTCCA CGAGGGGTTG GCGATCCTCA AGGTGCTGGA GATGTTCCAG ACCAAGCCGG TGCGCATGGC GATCGTCGTC GACGAGTACG GCAGCCTGGA GGGCATCGTC ACCCAGACCG ACCTGCTGGA GGCGATGGCC GGCGAGATCC CCGAGCCCGG CGAGGAGCGG ATGGTGGTCG AGCGCGAGGA CGGCTCGCTG CTCATCGACG GCATGATCTC GGCGACCGAC GCGTTCGACC GCCTCGGCTT TCCGGAGCGA CCGCGCTCGG ACGACTTCTC CACGCTCGCG GGCTTCTTCA TCGTCCAGCT CGGACGCATC CCGACGGAGG GCGATGCGAT CGAGACCCAG GGCTGGCGAA TCGAGGTCGT CGACATGGAC GGGCGGCGCA TCGACAAGGT GCTCGCCACG CGCCTGCCGG ACGCTTGA
|
Protein sequence | MSSLALGWVG EPALAHLIEP LLYWLPEKAA GLSAHTLAVA LAFAVITTLH IVLGELAPKS LALQRSERTA LVIVRPLTLF LLVFGPAIHL LNSLGNGVLR LLGLRPGEGE GNLPSPKELS LLVTASHEAG LLHEAQEDAV ARILAIGERR IREIMTPRNE VDWVDLDDSQ EEIIEAVRTC RHEQLVVSRA QIDDVVGVLR KQDLLDQFLD GKPLDVKGAT REPIVVHEGL AILKVLEMFQ TKPVRMAIVV DEYGSLEGIV TQTDLLEAMA GEIPEPGEER MVVEREDGSL LIDGMISATD AFDRLGFPER PRSDDFSTLA GFFIVQLGRI PTEGDAIETQ GWRIEVVDMD GRRIDKVLAT RLPDA
|
| |