Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4748 |
Symbol | |
ID | 5834552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5302116 |
End bp | 5303216 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641370545 |
Product | hypothetical protein |
Protein accession | YP_001642187 |
Protein GI | 163854144 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00502344 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGTGA TCGCTTGTGC CGCTTCGTGC GGCCCGGCAT CCTCACGCGT CCTTCCCCGC GACCTCCTGC TCGAGAATCC CAGCATCCGC ACCATGCCGA TGATCGGACC GCAGTTTCGC CGCGTAATCG GCGCCGCGCT CGCCGCGCTT GTCCTCGCTG GAGCGTGGCC GGCTGCGGCT GCGCCGGATG TCTGGGCCTG CCAGCGCTAC CGCACGGAAC TCGCCAACCT CAACGCCAGC GCCTCGACCG CGTCGGCGCT CCAGAGCGAG GTCGCGCGTC TCGAGTCCTA CTATCGCAGC CTCAACTGCG AGGGCGGCAA GTTCCTGTTC TTCGATACGC GGCCCCCGCA ATGCGGCGCC GTCGAAGCGC GGATCCGTTC GTTGAACGCC ACCTATGGCG GTGGGGACGG CGAGGTCGTA GCCGCCCGCC GTCGGCAACT GGTCTCGGCC GTGTCGAACG CCTGCACCGG CCTGATCCCC GGCGAGGGAC AAGAGGGCGT TTCAGGCGGC CAGACCGCCC GGGGCGGCCC GAAGGTCATC TGCGTGAAGA CCTGCGACGG GTCGTATTTC CCGATGGGCA ACCTGCCGGA CGGGCGCGGC GGGGCCGACG AGATGTGCCA GGCGCTCTGC CCCGGCACGG AAGCGGTCGC CTACTCGATG CCGCATGGCG ACGACGCCCT GAAATACGCC GCCACGCTCA AGGGCAGCCG CGCCTACACA GCGTTGCCGA CCGCCTTCAA GTTCCGCAAG AGCTTCACCG CCGACTGCTC GTGCAAGCAG GAGGGCCAGA CCTGGGCGCA GTCGCTCGTG AAGGCCGAGA GCATGCTGGT GCGGCACAAG GGCGACATCT TCGTCACCCC GATGACGGCC GAGAAGCTCT CCCGCGCGCC CAAGGTGCGC CTGACCCTGG TCGGCCGAGC CGACCGGACC GCGGCCGGCC TCGCAGCCGA CGCCGTCAAC CGCGACGGCG CCGTGGCCCC GGCCGCGAAG GACGCCGCCG AACAGGCCGA ACAGACAGGC TCGACCGGCG AGAGCCGCAG CGCCATTCGC GTCATCGTGC CGAGCCTGCT TCCGCCCCCG GCCCTGACAC CCATCCCCTG A
|
Protein sequence | MSVIACAASC GPASSRVLPR DLLLENPSIR TMPMIGPQFR RVIGAALAAL VLAGAWPAAA APDVWACQRY RTELANLNAS ASTASALQSE VARLESYYRS LNCEGGKFLF FDTRPPQCGA VEARIRSLNA TYGGGDGEVV AARRRQLVSA VSNACTGLIP GEGQEGVSGG QTARGGPKVI CVKTCDGSYF PMGNLPDGRG GADEMCQALC PGTEAVAYSM PHGDDALKYA ATLKGSRAYT ALPTAFKFRK SFTADCSCKQ EGQTWAQSLV KAESMLVRHK GDIFVTPMTA EKLSRAPKVR LTLVGRADRT AAGLAADAVN RDGAVAPAAK DAAEQAEQTG STGESRSAIR VIVPSLLPPP ALTPIP
|
| |