Gene M446_4018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4018 
Symbol 
ID6129339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4477755 
End bp4479074 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content72% 
IMG OID641644175 
Producttransposase 
Protein accessionYP_001770815 
Protein GI170742160 
COG category[L] Replication, recombination and repair 
COG ID[COG5659] FOG: Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCCC TGAACACTGT CCTGGCTGGC CAGCCGCGCT TCGGCGCCTA CGTCGGCACC 
CTCTCGGACG TCCTTGGCCA TGCGGACCGG ATCGCGCCGC TCAAGGCCTA CTGCACCGGT
CTCCTCCTGC CGGGCACGCG CAAGAGCATC GCGCCGATGG CGGCCCAGAT CGCCCCCGCC
CGCGTTCAGG CCACCCATCA GGTGCTGCAC CACTTCGTGG CTAATGGTGA GTGGTCCGAC
GCCGCGTTGC TGGCTCGGGT CCGCGCCGTG GTTCTGCCCG TCATCGAGAG CCGGGGCCCG
ATCCAGGCCT GGATCGTCGA GGATACCGGC TTTCCCAGGA AGGGCCGGCA CTCGGTCGGC
GTGGGCCGGC AATCCTGCGG CCAGGTCGGC AAACAGGATG ACTGCCAAGT TGCCGTGACG
CTCTCGCTGG CCAACGGCCA GGCCAGCCTG CCGATCGCCT ACCGCCTCTA CCTGCCGGAA
GCGTGGGCTC ACGATGCGCA GCGGCGCATG AAGGCCGGCG TGCCTCAGGA GATCGGCTTC
CAGACCAAGC CGGAGATCGC CCTGGATCAG ATCCGGGCCG CACGGGCCGA GCGTGTGCCG
CCCGGTCTCG TCCTGGCCGA CGCCGGATAC GGGAGCGATG CGGCGTTCCG CACGGCCCTG
ACGGCGTTGG GATTGCGCTA CAGCCTCGGC ATTGCCGCTT CGACCGGCCT GTGGCCGCCC
GGAACGGCGC CCTTGCCGCC CGAGCCCTGG AGCGGGCGTG GTCGGCCGCC GACCCGGATG
CGGCGCAGCC CCGATCACGA GCCGCTCTCG GCCGAGACGC TGGCACGCGC GCTGCCGGAC
GCGGCCTGGC AGGAGGTCAG GTGGCGTGCG GGCACGAACG GACTCTCGGC GTCGCGCTTC
GCGGCGGTGC GGGTGCGCCC GGCGCATCGC GACGAGCAGC GACGCGGGCC TGGTCCCGAG
GAGTGGGTCC TGATCGAGTG GCCGGACGGC GAGGATGCAC CGACGCAGTA CTGGATCTCG
ACCCTGCCGG CCGAGACGTC CCTGGCCGAG CTCGTCAGCC GGACGAAGCT GCGCTGGCGC
ATCGAGCGGG ACTACACGGA ACTGAAGCAG GAGATCGGGC TCGGTCACTA CGAGGGGCGG
GGCTGGCGCG GCTTTCATCA CCATGCCAGC TTGTGCATTG CCGCGTACGG GTTCCTGGTC
TGCGAACGGG GCCGGTTCTC CCCCGCGGGA CCCGAGCTCG CCAGGCTCCA CACGCCCGAC
CGACCCGCAG GTGACAGATC CCGCGGCACC CCCGACCCGG CCCGCGCGTC GTGGGCCTGA
 
Protein sequence
MDPLNTVLAG QPRFGAYVGT LSDVLGHADR IAPLKAYCTG LLLPGTRKSI APMAAQIAPA 
RVQATHQVLH HFVANGEWSD AALLARVRAV VLPVIESRGP IQAWIVEDTG FPRKGRHSVG
VGRQSCGQVG KQDDCQVAVT LSLANGQASL PIAYRLYLPE AWAHDAQRRM KAGVPQEIGF
QTKPEIALDQ IRAARAERVP PGLVLADAGY GSDAAFRTAL TALGLRYSLG IAASTGLWPP
GTAPLPPEPW SGRGRPPTRM RRSPDHEPLS AETLARALPD AAWQEVRWRA GTNGLSASRF
AAVRVRPAHR DEQRRGPGPE EWVLIEWPDG EDAPTQYWIS TLPAETSLAE LVSRTKLRWR
IERDYTELKQ EIGLGHYEGR GWRGFHHHAS LCIAAYGFLV CERGRFSPAG PELARLHTPD
RPAGDRSRGT PDPARASWA