Gene M446_5045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5045 
Symbol 
ID6135698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5525986 
End bp5527155 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content72% 
IMG OID641645181 
Productputative DNA topoisomerase I 
Protein accessionYP_001771806 
Protein GI170743151 
COG category[L] Replication, recombination and repair 
COG ID[COG3569] Topoisomerase IB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.110319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00920421 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGTCGG ATCCCGAGAC CGCGCCGTCC GCGCCGGAGG GTCGCGCCGA CCCGCGCGAG 
GCCGCCCGCG AGATCGGGCT GCGCTACGTC AGCGACGAGG AGCCCGGCTA TCGCCGCAAG
CGCAACGGGC GCGGCTTCCG CTACATCGAC CCGGACGGCC GGCCGGTCCG CGACGAGGCG
GTGCTCAAGC GCATCAGGGC CCTGGCGATC CCGCCGGCCT ACACGGATGT CTGGATCTGC
CGGCATCCCA ACGGGCACAT CCAGGCGACG GGGCGGGACG ATCGCGGCCG CAAGCAGTAC
CGCTACCATC CGCAGTTCCG GGAGGCGCGG GACTCGACCA AGTTCGCCCA CATGATGGAC
TTCGCGCGGG CGCTGCCGGC CCTGCGGGCG CGGGTGCAGG AGGATATGGG CCGGCGGGGC
CTGCCGCGGG AGAAGGTGCT CGCCACGGTG GTCCACCTGC TGGAGACCAC GCTGATCCGG
GTCGGGAACG ACGATTACGC CCGCGCCAAC CGCTCCTTCG GGCTCACGAC CCTGCGCGAC
CCGCACGTGA ACGTCGAGGG CGCGGAGCTG AAATTCCGCT TCAAGGGCAA GAGCGGCAAG
GTCTGGCAGC TGGCCCTGCG CGACCGGCGC GTGGCCAAGA TCGTGAAGGC CTGCCAGGAC
CTGCCGGGCC AGGAGCTGTT CCAGTACCTC GACGAGGACG GGGTGCAGCG CGACGTGACC
TCGGCCGACG TCAACGCCTA CCTGCGGGAG ATCACCGGCC GGGACATCAC CGCCAAGGAT
TTCCGCACCT GGTCGGGCAC GGTGCTGGCG GCCCTGGCGC TGCGGGAATT CGAGACCTTC
GACAGCCAGG CGGCGGCCAA GCGCAACGTG CGCAGCGCCA TCGAGCGGGT GGCCGAGCGG
CTCGGCAACA CGCCGACGAT CTGCCGCAAG TGCTACATCC ACCCGGAGAT CCTCGGCTCC
TACCTCGAAG GGAGCTTCCT GCTGCGGGCG CGCGACGAGA TCGAGGCGGA GCTGCGGGAG
GACATCCACC GGCTGCGGCC GGAGGAGACC GCCGTGCTGG CTCTGCTCCA GGGGCGGCTG
GCGGCGGACG CGCCCGCCGA GGGGCCCGCG GCGCAGAGGA GTCGCAAGGG AGCGGGCAGG
ACCCGCGCGG CTGCCCGCCG GGCGGCCTGA
 
Protein sequence
MPSDPETAPS APEGRADPRE AAREIGLRYV SDEEPGYRRK RNGRGFRYID PDGRPVRDEA 
VLKRIRALAI PPAYTDVWIC RHPNGHIQAT GRDDRGRKQY RYHPQFREAR DSTKFAHMMD
FARALPALRA RVQEDMGRRG LPREKVLATV VHLLETTLIR VGNDDYARAN RSFGLTTLRD
PHVNVEGAEL KFRFKGKSGK VWQLALRDRR VAKIVKACQD LPGQELFQYL DEDGVQRDVT
SADVNAYLRE ITGRDITAKD FRTWSGTVLA ALALREFETF DSQAAAKRNV RSAIERVAER
LGNTPTICRK CYIHPEILGS YLEGSFLLRA RDEIEAELRE DIHRLRPEET AVLALLQGRL
AADAPAEGPA AQRSRKGAGR TRAAARRAA