Gene TM1040_1740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1740 
Symbol 
ID4075804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1834669 
End bp1836678 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content60% 
IMG OID638007054 
ProductDNA topoisomerase IV subunit B 
Protein accessionYP_613735 
Protein GI99081581 
COG category[L] Replication, recombination and repair 
COG ID[COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit 
TIGRFAM ID[TIGR01055] DNA topoisomerase IV, B subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00895559 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCATA ATGAGATCAA ACCACATCAA AGAATGAGCA GCGATCCGCA CATGGCAGAT 
GACCTTCTGA GCCCCTCTTC GTCTGACACC TATGACGCCT CCTCGATCGA GGTTCTCGAG
GGCCTCGAAC CGGTGCGTCA GCGCCCCGGC ATGTATATCG GTGGCACGGA TGAGCGCGCG
TTGCATCACA TGGTGGCCGA GATCCTCGAC AACTCCATGG ACGAGGCGGT CGCGGGTCAT
GCCAACCGCA TCGAGGTGGA GCTGCACGCG GATTATTCGC TGACCGTGCG CGACAACGGG
CGCGGCATTC CAATTGATCC GCACCCGAAG TTTCCCGACA AATCCGCGCT GGAGGTGATC
CTCTGTACGC TGCACGCAGG CGGCAAGTTC TCGGGCAAGG CCTACCAGAC ATCGGGCGGC
CTGCACGGGG TTGGCTCTTC GGTTGTAAAC GCGCTCTCGG ATTCGCTGGT GGTTCAGGTG
GCCAAGAACA AGGAATTGTT CGAACAGCGG TTTTCGCGCG GCAAGCCCTT GGGCGGTGTG
GAAAAAATCG GCGCTGCCCC CAATCGGCGC GGCACCACGG TGACATTTCA CGCTGATGAG
CAGATCTTTG GCTCGCACCG GTTCAAACCG GCGCGCCTGT TCAAGCTCGT GCGCTCCAAG
GCCTATCTTT TTTCGGGCGT CGAGATCCGT TGGAAATCCG CCATTGATGA CGGAGAGACC
CCACAGGAAG CGACATTCCA TTTCCCTGGT GGTCTGCGCG ATTACCTCAC CGAAGTGCTG
GGCAAATCCT CGGTTTATGC CGATGCCCCC TTTGCAGGCA AAGTCGAGTT TCGCGAGAAG
TTCGGCGAGC CGGGTTATGT GGAATGGGCG ATCAACTGGA CGCCCTCGCG CGACGGGTTC
ACCCAGTCTT ATTGTAACAC CGTCCCCACC CCGGAAGGCG GCACCCATGT CGCGGGGTTC
TGGTCCGCGA TCCTCAAGGG GATCAAGGCC TATGGCGAGC TGGTCGGCAA CAAGAAGGCA
GGCCAGATCA CCCGCGACGA CCTGATGGCG GGTGGCTGTG CGCTGGTCTC TTGTTTTATC
GCTGACCCGG CGTTTGTGGG CCAGACCAAG GACCGCCTGT CGACCGAAGC CGCGGCCAAG
ATGGTTGAAA ACTCGGTGCG CGACCATTTC GACAACTGGC TTGCGGCCGA TACAAAATCC
GCGGGCGCCA TCCTTGATTT CCTGATCCTG CGCGCCGAGG AACGTCTGCG CCGCAGACAG
GAAAAGGAAA CCGCCCGTAA ATCCGCCACC AAGAAACTGC GCCTGCCGGG CAAGCTCACG
GATTGTACAT CCAAGGATCG CTCGGGCACC GAACTGTTCA TCGTCGAGGG GGACTCGGCC
GGTGGCTCCG GCAAAGGCGC GCGCAACCGC GAGTATCAGG CGCTCCTGCC GCTCAAGGGC
AAGATCCTGA ACGTGCTCGG TGCGGCCTCG GGCAAGCTTA CCTCGAACGC CGAGATCCGA
GATCTCTGCG AAGCGCTGGG GGTGGGGCTA GGAACCAAGT TCAACCTTGA TGACCTGCGC
TATGACAAGA TCATCATCAT GACCGATGCG GATGTGGACG GTGCACATAT CGCGTCGCTT
CTGATGACCT TTTTCTTTAC CCAGATGCGG CCGCTGATTG ACGGTGGCCA TCTCTATCTC
GCCTGCCCGC CCCTGTTCCG CCTCACCCAG GGGGCCAAGC GCGTCTACTG CCTCGACGAG
GCCGAGCGCG ACATGTGGAT GGAAAAAGGT CTGGGCGGCA AAGGCAAGAT CGACGTGTCG
CGCTTCAAGG GTCTTGGTGA AATGGATGCC AAGGACCTGA AGGAAACCAC GATGGATCCC
AAGACCCGCA AGCTGATCCG CGTCACCATC GACGAGGACG AGCCCGGCGA GACTAGCGAC
CTTGTGGAAC GCCTGATGGG CAAAAAGCCC GAGCTGCGAT TCCAGTACAT CCAGGAGAAC
GCCAAGTTCG TCGAGGAACT GGACGTTTAA
 
Protein sequence
MRHNEIKPHQ RMSSDPHMAD DLLSPSSSDT YDASSIEVLE GLEPVRQRPG MYIGGTDERA 
LHHMVAEILD NSMDEAVAGH ANRIEVELHA DYSLTVRDNG RGIPIDPHPK FPDKSALEVI
LCTLHAGGKF SGKAYQTSGG LHGVGSSVVN ALSDSLVVQV AKNKELFEQR FSRGKPLGGV
EKIGAAPNRR GTTVTFHADE QIFGSHRFKP ARLFKLVRSK AYLFSGVEIR WKSAIDDGET
PQEATFHFPG GLRDYLTEVL GKSSVYADAP FAGKVEFREK FGEPGYVEWA INWTPSRDGF
TQSYCNTVPT PEGGTHVAGF WSAILKGIKA YGELVGNKKA GQITRDDLMA GGCALVSCFI
ADPAFVGQTK DRLSTEAAAK MVENSVRDHF DNWLAADTKS AGAILDFLIL RAEERLRRRQ
EKETARKSAT KKLRLPGKLT DCTSKDRSGT ELFIVEGDSA GGSGKGARNR EYQALLPLKG
KILNVLGAAS GKLTSNAEIR DLCEALGVGL GTKFNLDDLR YDKIIIMTDA DVDGAHIASL
LMTFFFTQMR PLIDGGHLYL ACPPLFRLTQ GAKRVYCLDE AERDMWMEKG LGGKGKIDVS
RFKGLGEMDA KDLKETTMDP KTRKLIRVTI DEDEPGETSD LVERLMGKKP ELRFQYIQEN
AKFVEELDV