Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3475 |
Symbol | |
ID | 4786293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3686344 |
End bp | 3687372 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640092055 |
Product | hypothetical protein |
Protein accession | YP_001022663 |
Protein GI | 124268659 |
COG category | [R] General function prediction only |
COG ID | [COG2144] Selenophosphate synthetase-related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.451422 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGATG CAATGCTGGG ACAGGGGGGG CTCGACGGGC TCGCCTCGGC GTTGCTGCGC GGGCGCGGCT TCGCACACAA GCGCGACATC AGCGACGTGG TGTCGGCGCT GTCGGCCGCG CTGCCGGGCG GCACCGCCGC GCTGGGCCAG GCGGTGGGCG TCGGCGACGA CTGTGCGGCG ATCCCCGACG GCGACGGCGG CTACCTGCTG TTCGCGATCG AGGGTTTCGT CGACGACTTC GTGCAGCGCA TGCCCTGGTT CGCCGGCTAT TGCGGCGTGA TGGTCAACGT GAGCGACATC TGCGCGATGG GCGGGCGGCC GATCGCCGTG GTCGACGCGC TGTGGAGCCG CGGCATGGCG CCGGGCCAGC AGGTGCTCGA GGGCCTGGCG GCCGCCTCGC AGCGCTACGG CGTGCCGATC GTCGGCGGCC ACAGCAACAA CCAGGCGGTC GGCGGCCAGC TCGCGGTGGC GATCCTCGGC CGAGCGAAGA CGCTGCTGAC GAGCTTCAAC GCCCGCCCCG GCGACACGCT GGTGATGGCG ATCGACCTGC GCGGCGCCTA CCAGGAGCCC AACCCCTACT GGGACGCGTC GACCCGCGCG CCGGCCGAGC GCCTGCGCGC CGACCTCGAG CTGCTGCCGG CGCTGGCCGA GAGCGGCCTG TGCGATGCGG CCAAGGACAT CAGCATGGCC GGCGCCGTGG GCACGGCGCT GATGCTGCTG GAGTGCTCGC AGGTCGGCGG CGTGATCGAC GTGCAGGCGA TCCCGCGCCC GCCCGGCGTG CCGCTGCTGC GCTGGCTGCA GTCCTTTCCG AGCTACGGCT ACGTGTTCAG CGTGCGGCCG GCGCAGGCAG CCGCGGTGGC GCGGCATTTC GAGTCGCAGG GCATCGCCTG TGCCGCGGTC GGCGAGGTCA CGGCGACCCC GCAGCTGCAT CTGCGCGACG GCGAGACGAG CGCGCTGCTG TGGGACCTGG TGGCGCAACC CTTCATCGGC GCGCGAGCCG TCGTGCCGCG GGAGCCGGCC CATGTCTAG
|
Protein sequence | MEDAMLGQGG LDGLASALLR GRGFAHKRDI SDVVSALSAA LPGGTAALGQ AVGVGDDCAA IPDGDGGYLL FAIEGFVDDF VQRMPWFAGY CGVMVNVSDI CAMGGRPIAV VDALWSRGMA PGQQVLEGLA AASQRYGVPI VGGHSNNQAV GGQLAVAILG RAKTLLTSFN ARPGDTLVMA IDLRGAYQEP NPYWDASTRA PAERLRADLE LLPALAESGL CDAAKDISMA GAVGTALMLL ECSQVGGVID VQAIPRPPGV PLLRWLQSFP SYGYVFSVRP AQAAAVARHF ESQGIACAAV GEVTATPQLH LRDGETSALL WDLVAQPFIG ARAVVPREPA HV
|
| |