Gene Csal_2146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2146 
Symbol 
ID4026486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2417369 
End bp2419333 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content68% 
IMG OID637967351 
ProductDNA topoisomerase III 
Protein accessionYP_574196 
Protein GI92114268 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.232139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTGA TCATCGCCGA AAAACCCAGC CTGGCACGAG CCATCGCCGA TGCCCTGCCA 
ACGGCGCCGC AGCGCCATGA GGGCTACCTG GTGTGCGGGG AGACCTGCAT CAGCTGGTGC
GTGGGACATC TCCTCGAGCA GGCACCGCCC GACGCCTACG ACGCCCGCTA TCAGCAATGG
CGTCTGGATC ATCTGCCCAT CGTGCCCCGG CAATGGAAGC TGATCCCGCG TTCCAAGGCG
CGCGCGCAGC TGGCGGTGAT TCGCAAGCTG CTCAAGCGTG CCGACACCGT CGTGCACGCC
GGCGACCCCG ACCGCGAGGG GCAACTGCTG GTGCAACAGG TGATCGCGCA TCTGGGCTGG
CGCGGCTCGA TCTCGCGCCT GCTGGTCAGC GACCTCAATC GCCCCGCCGT GCGCCAGGCC
CTGCAGTGCA TCGAGGACAA TGCGCGTTAT CAATCGCTGT TCGAGGCCGC CGAGGCCCGT
TCGAGGGCCG ACTGGGTATA TGGCATCAAT CTGTCTCGCG CCTGGACCCT CACCGGTCGC
CAGGCCGGCT ATCAGGGCGT CCTGTCGGTC GGCCGCGTGC AAACCCCGGT GCTGGGGCTC
ATCGTGCGCC GCGACCTGGA CATCGAAGCC TTCACGCCAC GCCCTTTCTA CCCCCTGTGG
GCGGACCTGA AGGTCGAGCA CGGAACCCTG CGTGCCTGGT GGCAACCGCA AGCCCCCCAC
CCACTCGACG AGCAGGGCCG ACTGCTCGAG CGCGCCCCGG CCGAGGCCCT GGCGGCCCAT
CTGCCCGGCG CCGAGGGCCA CCTGAGCGAG TTGCAGAGCA AGCGACAGAC GCAATCGGCA
CCGTTGCCGT ATTCGCTTTC TGCCCTGCAG GTCGATGCCG CGCGCCGCTA CAAGTTGCCG
GCCAAGCGCG TGCTGGATAT CTGCCAGACA CTCTATGAAC GCCACCAGTT GATCACCTAT
CCGCGTTCGG ATTGCCGCTA CCTGCCCGAG GGGCATTTCG CCGCGGCGAA GAAGACGCTG
GACACGGCCT GCGCCCAGGA CGCCACCCTG CGCGGGTGGT ATCAAGGGGC GGATTTTTCG
CGCCGTTCGC GCGCCTGGAA CGACAAGCAG GTCGGCGCCC ACCACGCCCT GGCACCCACC
GGGCGGACGT TCGACGCCTC CCGCCTGACC GGCGACGAGG CCAATGTCTT TCGTCTGATT
GCGCGCAACG TGCTGGCGCA GTTCTATTCA CCGCTGGTGA CCCGCGACGT CAAGGCAGCG
TTCTCGATTC ACGGCGAGCA CTTTCGTGCC CAGGGGCGCG AGATTCTCGA GCCTGGCTGG
AAGCCGCTCT TCACGACGCG CGACGAGGCC CCGCCCTTGC CCGCGCTGCA CGAAGGCGAG
GCGGCACGGG TCGTCGACTG CGCGGTGGAG GATCGTCAGA CACGGCCGCC GGATCCCTTC
ACCGATGCCA GCCTGATCAA GGCCATGATG AACATCGCCC GCTACGTCGA CGACGCCACG
GTCAAGCGTA CCTTGCGGGA TACCGACGGA CTGGGGACCG AGGCCACGCG CGCGGGAATC
ATGGAAACCC TCGTCACGCG CGGTTATATC GTGCGTCAGC AAGGCGCGTT ACGTGCCACT
CGCACCGGAC GCGCGCTCAT CCAGGCCTTG CCGGATACCG CCACCCGTCC CGAGCGAACG
GCACTCTGGG AACAACGTCT TGCCGACATC GCCGAGGGCC GCGGCGATGC GGATGCCTTC
CTCGACGCGC TGGTGGGTGA TGTCCGCCAA CTGCTCGACC AGGCGGATGC CGCACGCATG
CGCCAGGCGC TGGCCACCAC CGGCGGTGCC GAGCCGAGCT CGCCCTCGCG GGGCAAGAAG
CGCGGCACCC GGCGCGGCAG CGGCGACGCC TCGTCCAGCC CCCGACGCCG CGCCGGGAAG
TCCACGGCCA GGCGTCGCAC CTCCTCCAAG AAAGGCCAGA CATGA
 
Protein sequence
MRLIIAEKPS LARAIADALP TAPQRHEGYL VCGETCISWC VGHLLEQAPP DAYDARYQQW 
RLDHLPIVPR QWKLIPRSKA RAQLAVIRKL LKRADTVVHA GDPDREGQLL VQQVIAHLGW
RGSISRLLVS DLNRPAVRQA LQCIEDNARY QSLFEAAEAR SRADWVYGIN LSRAWTLTGR
QAGYQGVLSV GRVQTPVLGL IVRRDLDIEA FTPRPFYPLW ADLKVEHGTL RAWWQPQAPH
PLDEQGRLLE RAPAEALAAH LPGAEGHLSE LQSKRQTQSA PLPYSLSALQ VDAARRYKLP
AKRVLDICQT LYERHQLITY PRSDCRYLPE GHFAAAKKTL DTACAQDATL RGWYQGADFS
RRSRAWNDKQ VGAHHALAPT GRTFDASRLT GDEANVFRLI ARNVLAQFYS PLVTRDVKAA
FSIHGEHFRA QGREILEPGW KPLFTTRDEA PPLPALHEGE AARVVDCAVE DRQTRPPDPF
TDASLIKAMM NIARYVDDAT VKRTLRDTDG LGTEATRAGI METLVTRGYI VRQQGALRAT
RTGRALIQAL PDTATRPERT ALWEQRLADI AEGRGDADAF LDALVGDVRQ LLDQADAARM
RQALATTGGA EPSSPSRGKK RGTRRGSGDA SSSPRRRAGK STARRRTSSK KGQT