Gene M446_4005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4005 
Symbol 
ID6132980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4459612 
End bp4460610 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content68% 
IMG OID641644162 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001770802 
Protein GI170742147 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATGC TCGCGATCGA TCTGGCCAAG CAGTCGTTTC ACGTTCACGG CGTCGATGCC 
GATGGTCAGG TGATCTCCCG GCGCGTCGGG CGCACCAAAC TTCCGGCGCT GGTGGCCAGC
CTCGCCCCGA AGGTGATCGC CATGGAGGCT TGCGCCACGG CCCATCATTG GGCGCGAGCT
TTCCTCGCGG CCGGGCATGA GGTTCGGCTG ATCAACCCGC GCTTCGTCAA GCCGTTCGTG
CGCGGCTCGA AGAACGATGC CGTCGACGCC GAGGCGATCT TCGACGCCGC CTCACGTCCC
ACGATGCGGT TTGTGCCTGT GAAGTCGACC GAGCAGCAAG ACCTGCAGTC GCTTCATCGC
GTCCGCGATC GGCTGGTCTC GCAACGCACG AACCTGATCA ATCATACCCG TGGGCTCCTG
GCTGAGTACG GCCTCATCTA CCCGAAGGGT GCGGCCCGCT TTCCAGCGCG TGTGCGGGCG
GAACTTTCCG AGGCGGGACT GTCGCCGATG GCGCGAGCCA CCTTCGCGGC CCTGCTCGAC
GAGTTGGAGA CCCTGGAGAC GCGGCTTGAG CGGCTCGACG ATCAACTTCG GGCGATCTGC
CGCGAAGACG TCGTCTGCCG CCGCCTGATG ACGTTGCCTG GCGTGGGCCC GGTCGTCGCC
ACCGCCCTCA AGGCCAGCGT CGGCGATGCC CGCCAGTTCC GCTCAGGGCG CGAACTCGCG
GCCTGGATCG GCTTGGTGCC GCGACAGTAC TCCACCGGCG GCAAGCCGCA CCTCGGGGGC
GTCGGACGCC GGGCCAACCA CTATCTGCGG CGCCAACTCG TGCACGGCGC CCGCGCGGTC
GCCTTGCGCC TGGCCACGAA GACCGATCCG CGCTCACGCT GGTTCCAGGC GGTGATCGAC
CGGCGCGGGT TCAACAAGGG GATCGTGGCG ATGGCCAACA AGACCGCGCG GATAGCCTGG
GCGATGCTGA GGCGCGAGGA GGATTACGCC CGCGCCTGA
 
Protein sequence
MQMLAIDLAK QSFHVHGVDA DGQVISRRVG RTKLPALVAS LAPKVIAMEA CATAHHWARA 
FLAAGHEVRL INPRFVKPFV RGSKNDAVDA EAIFDAASRP TMRFVPVKST EQQDLQSLHR
VRDRLVSQRT NLINHTRGLL AEYGLIYPKG AARFPARVRA ELSEAGLSPM ARATFAALLD
ELETLETRLE RLDDQLRAIC REDVVCRRLM TLPGVGPVVA TALKASVGDA RQFRSGRELA
AWIGLVPRQY STGGKPHLGG VGRRANHYLR RQLVHGARAV ALRLATKTDP RSRWFQAVID
RRGFNKGIVA MANKTARIAW AMLRREEDYA RA