Gene M446_4172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4172 
Symbol 
ID6130896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4608817 
End bp4610052 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content71% 
IMG OID641644318 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001770958 
Protein GI170742303 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGCG TGATCGGAAT GGACCTCCAC CGCACCTTCG CCGAGGTGGT GGTGTGGGAG 
GACGGCCGGC TCAGGCATGC CGGCCGGGTC GACATGACCC GCAGCGGACT GGAGGGCTTT
GCCCGCACCC TGCAGGCCAG CGATGAGGTG GTGATCGAGG CGACCGGCAA CGCGATGGCC
GTCTCGCATC TGCTAAAACC GCACGTGGCG CGGGTGGTGA TCGCCAACCC GCTGCAGGTG
AAGGCGATCG CCCATGCGCA CGTGAAGACC GACAAGATCG ACGCGGGCGT GCTCGCCAAC
CTCTATGCGG CGGGCTACCT GCCGGAGGTC TGGACGCCGG ACGCGGCCAC CGAGCGGCTG
CGCCGGCTCG TGGCGCGGCG CAATCAGGTC GTGCGCCATC GCACGTGCCT CAAGAACGAG
ACCCACGCGA TCCTGCACGC TCACCTCGTG CCGCCATGCC CGCACGCCGA CCTGTTCAGC
CGGGTCGGCC GAGCCTGGCT GGAACGGCAG GCGTTGCCCG ACGACGAGCA CGCGGCGGTG
CGGCGGCACC TGCGCGAGTT CGATGCTCTG GGCGAGGACC TTGCCGTCCT CGATCGAGCG
ATCGGCGAAG CGACGGTCGA CAGTCCCGTG GTGCGCCGCC TGCTCACCGT CACCGGCATC
AACGTGACGG TGGCCGCCGG GCTCGCCGCC GCGATCGGCG ACGTGCGCCG CTTCCCCAGC
CCGCAGAAGT TGGTGAGCTA CTTCGGGCTG AACCCGCGCG TGCGCCAATC CGGCCTCGGG
CTCGCCCAGC ACGGCCGCAT CAGCAAGGCG GGGCGAAGCC ATGCCCGAGC GATGCTGGTC
GAGGCGGCCT GGGCCGCCGC CAAGGCCCCA GGGCCGCTGC ACGCGTTCTT CGTGCGGGTG
CGCGCCCGGC GCGGTCACCA GATCGCGGCG GTGGCCACGG CGCGCAAGCT CGCCGTGCTC
TGCTGGCACC TGCTGACCAA GGAGGCGGAC TATCTCTGGG CTCGCCCTGC CCTCGTGGCC
ACCAAGGTGC GCGGTCTCGA ACTGCAGGCC GGTCTGCCGC AGAAGAAGGG CAACCGGCGC
GGTCCAGCCT ACGCCTACAA CGTCAAGGCG TTACGCGAGC AGGAGATGGA GATCGCCCGG
CGGGCGGAGA CGGCTTACGA GCAGGTCGTG GCGCACTGGA CGCCGCGTCC CTCAAAGGCG
GCGCGCGGAC GCCTCAAGCC GGCAGGGCTC GGATGA
 
Protein sequence
MRRVIGMDLH RTFAEVVVWE DGRLRHAGRV DMTRSGLEGF ARTLQASDEV VIEATGNAMA 
VSHLLKPHVA RVVIANPLQV KAIAHAHVKT DKIDAGVLAN LYAAGYLPEV WTPDAATERL
RRLVARRNQV VRHRTCLKNE THAILHAHLV PPCPHADLFS RVGRAWLERQ ALPDDEHAAV
RRHLREFDAL GEDLAVLDRA IGEATVDSPV VRRLLTVTGI NVTVAAGLAA AIGDVRRFPS
PQKLVSYFGL NPRVRQSGLG LAQHGRISKA GRSHARAMLV EAAWAAAKAP GPLHAFFVRV
RARRGHQIAA VATARKLAVL CWHLLTKEAD YLWARPALVA TKVRGLELQA GLPQKKGNRR
GPAYAYNVKA LREQEMEIAR RAETAYEQVV AHWTPRPSKA ARGRLKPAGL G