Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3301 |
Symbol | |
ID | 5831444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 3660314 |
End bp | 3661273 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641369101 |
Product | glycosyl transferase family protein |
Protein accession | YP_001640759 |
Protein GI | 163852716 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.904037 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.657687 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGCGA CCCCCGTACC CTCGCCCTCT GCGTCGGTCC TCGACGTCGT CATCGTCAAC TGGAATGCCG GGGACCAGCT CCGGGCCTGC CTCGCGAGCC TCGCCGCGAG CGAGGGGGCG GAGCACCTAC GGGTCGTCGT CGTCGACAAC GCCTCCTCCG ACGGTTCGGC GGAGGGGCTG GATCAGCCCG GCCTCGCACT CACGGTGCTG CGCAACGCAG ACAATCGCGG CTTCGCTCGT GCCTGCAACC AGGGCGCGGC TTTGGGCTCG GCCGCGGCCA TCCTGTTTCT CAACCCCGAT ACGGGGGTGA GCCGGGACGG TATCGCCGCC GCCCGCGCAC GGCTCGACGC CGATCCCGGC ACCGGCATCG TCGGCGCCCG GCTGGTCGAT GACGCCGGGC AGACGCACCG CACCTGTGCC CGCCACCCGA CGGGCGCGCG CCTGATCGCA CACACCCTGT TCCTCGACCG GTTGCTGCCC GGCCGCGTCG CGCCGCACTT CCTGCTCGAT TGGGACCATG CTGAGACGCG GGCCGTCGAT GCGGTGATGG GCGCCTTCCT GATGATCCGG CGCCCGCTCT TCGCTCGGCT CGGCGGGTTC GACGAGCGCT TCTTCGTCTA CTGGGAGGAT GCCGACCTCT GTGCCCGTGC CGCCGCCGCC GGCTTTGCGG TGTGTCACGT CGCAGAGGCC GAGATCCGCC ACCGCGGCCA GGGCACCACC GAGGCGGTGA AGGACCGACG CCTGTTCTAC TTCCTGCGGG CGCAGACGCT CTACGCGCAC AAGCATCACG GCCGGGCGGT GTCCCTCGCG GTGCTGGCGG CCGCGCTGGC CGTGAACCTA CCCGTCCGCC TCGGCCGGGC GCTCGTGCGC GGTTCGGGCG GAGACGCAGG TGCGGTGATC CGCGCAGGGC TTATGCTGAT ACGGGCCGTG CCGCGCCTGT TGACCGGCAG CGGTCGATGA
|
Protein sequence | MTATPVPSPS ASVLDVVIVN WNAGDQLRAC LASLAASEGA EHLRVVVVDN ASSDGSAEGL DQPGLALTVL RNADNRGFAR ACNQGAALGS AAAILFLNPD TGVSRDGIAA ARARLDADPG TGIVGARLVD DAGQTHRTCA RHPTGARLIA HTLFLDRLLP GRVAPHFLLD WDHAETRAVD AVMGAFLMIR RPLFARLGGF DERFFVYWED ADLCARAAAA GFAVCHVAEA EIRHRGQGTT EAVKDRRLFY FLRAQTLYAH KHHGRAVSLA VLAAALAVNL PVRLGRALVR GSGGDAGAVI RAGLMLIRAV PRLLTGSGR
|
| |