Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4847 |
Symbol | |
ID | 5835331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5415365 |
End bp | 5416426 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641370644 |
Product | GSCFA domain-containing protein |
Protein accession | YP_001642286 |
Protein GI | 163854243 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCCTT ACCGCGATCT ACCGGCCGAG CGCTTCTGGC GCAAGGCGGT GGCGGGTGTG CCGCCCTTCG CCCTCGATCC GGGTCCGCGC GAGACCTTTC GCATCGCCCG CACCGACCGC GTCGCGACCG CCGGCTCCTG CTTCGCGCAG CGCGTCTCGC AGGCGCTGGC CCGCGGCGGA TTCCGATACC ACGTCACGGA AACCGCGCCC GATGGGTTGA GCGAGGCCGA GGCGACCGCG CGTCAGTTCG GCACCTTCTC GGCGCGTTAC GGCAATCTCT ACGGCCCGCG CCAGTTCGTG CAGCTCTTCG ACCGGGCCTT CGGTGCCTTC GATCCGCAGC TCAAGGCGTG GCAGCGGGAG GACGGACGCC TCGTCGATCC GTTCCGCCCG ACCATCGAAC CTGAAGGCTT TTCCGACGAG GCGGCGGTGA TCGAAGCCCG CGAAACCCAT CTCGGCCGGG TGCGCGACCT TTTCGAGAGC CTCGACGTGC TGGTCGTGAC GCTCGGCCTC ACCGAGGGCT GGCGTTGTCG TGCCGACGGG GCGGCGCTCT CCCTCGCGCC GGGGGTCGCG GGCGGGCAAT TCGATCCGCA GGAGGTCGCC TTCGTCAATG CGGGCACGGC CGAGGTGATC GCCGACGTCA ATGGGTTCCT CGATCGGCTC TGGAGCGTGA ATGCGGCGGC ACGGGTCATC CTCACCGTCT CGCCGGTGCC GCTGATCGCG ACCTATCGCG ACCAGCATGT GCTCGTCGCC AACGCGCACT CGAAGGCGGT GCTGCGGGCG GCCGCCGGGG AGGTCTGCGA ACGCAGCGAT CCGCGCCTCG TCTACTTCCC GTCCTACGAG ATCATCACGA GCCACACCAA TGGCGGGCGC TACTACGAGG AAGACCAGCG CAGCATCGCC GAGGCCGGCG TCGCGCACGT GATGCGCGCC TTCATGGCGA GCTTCGCCCC CGGTTCGGCC GAGAAACCGG AGCCGCAGCC CTCCCACGAC AGCGAAGCGG AATTCGCCGG CACCGCAGGC GTCATCTGCG ACGAGGAGGC GATCGAGCGC AGCCTCGCCT GA
|
Protein sequence | MNPYRDLPAE RFWRKAVAGV PPFALDPGPR ETFRIARTDR VATAGSCFAQ RVSQALARGG FRYHVTETAP DGLSEAEATA RQFGTFSARY GNLYGPRQFV QLFDRAFGAF DPQLKAWQRE DGRLVDPFRP TIEPEGFSDE AAVIEARETH LGRVRDLFES LDVLVVTLGL TEGWRCRADG AALSLAPGVA GGQFDPQEVA FVNAGTAEVI ADVNGFLDRL WSVNAAARVI LTVSPVPLIA TYRDQHVLVA NAHSKAVLRA AAGEVCERSD PRLVYFPSYE IITSHTNGGR YYEEDQRSIA EAGVAHVMRA FMASFAPGSA EKPEPQPSHD SEAEFAGTAG VICDEEAIER SLA
|
| |