Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_3851 |
Symbol | |
ID | 6132071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 4295569 |
End bp | 4296669 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641644016 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_001770658 |
Protein GI | 170742003 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGACC GCGAACAGAT CCAGACGAAC CTGCCTGACC TTGCTGTGTT CGTGACGCTG GAACTCAGCA AGTCGGCCAG GTTCCTGGCC GCGCAGGCCA TCCCGAGTGG GAAGACCTCG GCACACCGGC TCAGCGGCGG AGATGTGGAG GGTCTGCTCG CTCTGCTGCG CCGCCTGCAA GCCCGCGAAC AGCGCAGCTC TGGCCGGGAG TTGGCGGTCA TCCTCGGCTA CGAGGCTGGT TACGACGGCT TCTGGCTGCA GCGCCGCCTC GCGGCCGAGG CGATCACCTG CTTCGTCATA GATCCCGGCA GTCTCCAGGT GGACCGCCGT GCCCGACGGG CCAAGACCGA CCGACTCGAC GCGGCCATGC TGCTCCGAGC TCTCATGGCG TGGTGCCGCG GCGATTACGC CGCCTGCCGC ATGGTTCAGG TGCCCTCGGT CGAACGCGAG GATGCCCGGC GCACCCATCG CGAGCGGCAG CGCCTGATCG CCGAGCGGGT TCAGCACGTC AGCCGGATCA AGGGCCTGCT CGCGACACAA GGCGTGTACA CCTTCCAACC GCTGCGCCGG GATCGTTCGG AGCGGCTGGC AGAGGTCCGC ACCGGCGATG GGCGCGAGCT GCCTCAGCGC CTGCGCGGTG AGGTCGAACG TGAGTTCCGG CGGCTGGAAC TCGTGCTCGA GCAGATCGCG GCAGCCGAGG CTGAGCGGGA CGCCGCCGCG GCCAATCCCG CGATCGAGGA CGCCGACGCC GAGAAGGTGG TGCGTCTGGC CCGTCTCGGC GGCATCGGCA CCGAGTTAGC CACGGTGCTG GTGCGCGAGG CGCTGTACGG GCCATTCGAC AACTGCAAGC AGGTGGCCGC CTACGCTGGC CTCACGCCAA GCCCATACGC CAGTGGCGAC CGTCAGCGTG ACCAGGGCAT CTCGGAGGCC GGCAACCCGC TGCTCCGCAA GTCGATGATC GAGTTGGCTT GGCTGTGGCT GCGCTATCAG CCGGGTAGCG GGCTGGCCCG CTGGTTCGTC GAGCGGGTCG GCACGGGACG CGGGCGCATC CGCAAGATCA CGGCGGTCGC CCTGGCGCGC AAGCTGCTGT TCGCCCTATA G
|
Protein sequence | MSDREQIQTN LPDLAVFVTL ELSKSARFLA AQAIPSGKTS AHRLSGGDVE GLLALLRRLQ AREQRSSGRE LAVILGYEAG YDGFWLQRRL AAEAITCFVI DPGSLQVDRR ARRAKTDRLD AAMLLRALMA WCRGDYAACR MVQVPSVERE DARRTHRERQ RLIAERVQHV SRIKGLLATQ GVYTFQPLRR DRSERLAEVR TGDGRELPQR LRGEVEREFR RLELVLEQIA AAEAERDAAA ANPAIEDADA EKVVRLARLG GIGTELATVL VREALYGPFD NCKQVAAYAG LTPSPYASGD RQRDQGISEA GNPLLRKSMI ELAWLWLRYQ PGSGLARWFV ERVGTGRGRI RKITAVALAR KLLFAL
|
| |