Gene Plut_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_2073 
Symbol 
ID3744202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp2304204 
End bp2306612 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content59% 
IMG OID637770104 
ProductDNA topoisomerase I 
Protein accessionYP_375958 
Protein GI78187915 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAA AAGCAGCGGC ACTGTCCGCC AGAAACAGGA CCCTGATCGT CGTAGAGTCG 
CCCTCCAAGG CTAAGACGAT CAACAAGTAC CTCGGGGACG GCTACACGGT GTTCGCCTCG
GTAGGCCATA TCAAGGATCT TCCCAAAAAG GAGATCGGGC TTGATTTCGA CAATCATTAC
GAGCCTCGCT ACGAGATCAT CCCCGGCAAG GAAAAGGTGG TGCGACAGTT GAAGAAGCTT
GCGTCGGAAG CCGACGGAGT CCTGATTGCA ACTGACCCTG ACCGCGAGGG CGAGGCTATC
GCGTGGCACA TCGCCAATGA AATCGGCAGC ACGCCGAAAC CGGTATTCAG GGTGATGTTC
AACGAAATCA CCAAAAGCGC CATCCTTGCA GCCATTGATG AGCCGCGCCA GATCGACTAC
CGGCTGGTCC GATCCCAGCA GACCCGTCAG GGGCTCGACA AAATCGTAGG CTACCGGGTC
AGCCCGTTTC TCTGGAATGT GGTCATGCGC GGCCTGTCCG CCGGGCGCGT GCAGTCCGTC
GCCCTGCGGC TGATCTGCGA GCGCGAGGTG GAGATCAACA ATTTTGAGGT TCAGGAGTAC
TGGACCATTG CGGCCGACTT CATCACAGAG GGCAAGGAAA CCTTCCGTGC ACGGCTGGTG
AAGCTCGAAG GAAAGAAACC CGAACTCTCG AACCAGGCAG AGGCGGAAGC CGCGGCAGAT
CTCGTACAGA AAGGCAGCTA TACGCTTCTG GACATCGCAC CGCGCACCCA GCAGCGCAAG
CCTCCGCTGC CCTTCACCAC GTCCCTCCTC CAGCAGGCTG CATCGAACCA GCTGGGATTC
GGCTCGAAAA AGACCATGCG CCTCGCACAG CAGCTCTATG AAGGCATTGA TCTCGGAGAA
GAAGGTGCGA CCGGCCTCAT CACCTACATG AGGACCGACT CTATCCGCAT CGGCAATGAG
GCGGCAGGAC AGGCCCGCAG CTTTATCGAA CACGCCTTCG GCAAGGAGTA CATAGGTTGG
GGCGGCGCGG CAAAATCAAG TAAAAATGCA CAGGACGCCC ATGAGGCAGT CCGTCCGACC
GCCGTCGGCA GGCGCCCGGA ACAGCTTAAG TCCTATCTCT CTGCGGACCA GTTCAAGCTC
TACGACCTCA TCTGGAAGAG GTTCGTGGCC GCCATGATGG CTCCGGCAAA AATCGAGCAG
ACAAGGGTTG ACGTCGGTGA TCCGGCCCGC GACATCATGT TCAGGGCAAG CGGCAGCAGG
GTGCTCTTCC CCGGCTTCAT GCGGGTGTTC AACGACCAGG AGGAGCTTGA GTACGAAGCC
CGGAAATCGA CGAAAGACGA AGGTGAGAAA GAGCAGGAGG TGCGCCTCCC GAAGCAGCTT
GAGAAGGACG GACTGCTCGG CCTCGGAGAA ATCGACAACC GCCAGAGCTT CACCCGTCCC
CCGGCGCGCT TCAGCGAAGC GACCCTCGTC AAGGAGCTCG ACAACTACGG CATCGGACGC
CCCTCGACCT ATGCATCCAT ATTCTCAACC CTTCAGGACC GCCGCTATGT GGAGCTGCAG
AAAAAGAAGA TCATCCCCAC CATGCTCGGC ATGGACGTGT CGCAGATCCT CGTTGCCAAC
TTCCCCGACC TCTTCAACGT AGACTTCACC GCCGAAATGG AGGGTGAGCT CGACAAGGTT
GCTGCCGGAG AGGACGAGTA CGAGAAAGTA CTCGACAGCT TCTATCGCCC GCTGGAAACT
GCGCTCAGCG TCCGGAAGGA AGATCCGCTG ATCCCCCAGA ACCGCGATGC GGCGCTCTGC
GAGAAATGCG GCGAAGGGCA CATGATCGTC AAGTGGACCC AGAGCGGCAA GTTCCTCGGC
TGTTCGCGCT ACCCGAAATG CCGCAACATC AAGCCGATCA GCACCAACCG CGAAAAGCCC
AAGGATACCG GCATCCTCTG CCCGTCATGC GGCGAAGGGC ACATGCTCCT GCGAAACGGC
CGGCTCGGCC CGTTCCTTGC ATGTTCCAGC TACCCGAAGT GCAACACCCT GCTCAACCTC
TCGAAACAGC GGCATGTCGA ACCGATGAAG ATTCCTCCGG TACAGACCGA TCTGGCCTGT
CCGAAGTGCG GCTCGCCGAT GAACCTCCGC GTCGGAAAGC GGGGGCCCTG GCTCGGATGC
TCGAAGTTCC CGAAATGCCG GGGACGGATG GCATGGAACT CCCTTGACGA AGGAGTCCGG
GCACACTGGG AGGCGGTCAT GGAAGAACAC AGGAAAGCCC ATCCCTCCGT CACCCTCATG
ATGCTCGACG GTACCCCTGC ACCGATGCAG ATGGCAGTCG ACGACATCAT ACAGATGGCA
GAAGAAAGCG GCATGGTCGA AATAGCCGTC GAAGCGCCTG AAGAGGCAGT GGCAGAAAAA
CAGGACTAA
 
Protein sequence
MASKAAALSA RNRTLIVVES PSKAKTINKY LGDGYTVFAS VGHIKDLPKK EIGLDFDNHY 
EPRYEIIPGK EKVVRQLKKL ASEADGVLIA TDPDREGEAI AWHIANEIGS TPKPVFRVMF
NEITKSAILA AIDEPRQIDY RLVRSQQTRQ GLDKIVGYRV SPFLWNVVMR GLSAGRVQSV
ALRLICEREV EINNFEVQEY WTIAADFITE GKETFRARLV KLEGKKPELS NQAEAEAAAD
LVQKGSYTLL DIAPRTQQRK PPLPFTTSLL QQAASNQLGF GSKKTMRLAQ QLYEGIDLGE
EGATGLITYM RTDSIRIGNE AAGQARSFIE HAFGKEYIGW GGAAKSSKNA QDAHEAVRPT
AVGRRPEQLK SYLSADQFKL YDLIWKRFVA AMMAPAKIEQ TRVDVGDPAR DIMFRASGSR
VLFPGFMRVF NDQEELEYEA RKSTKDEGEK EQEVRLPKQL EKDGLLGLGE IDNRQSFTRP
PARFSEATLV KELDNYGIGR PSTYASIFST LQDRRYVELQ KKKIIPTMLG MDVSQILVAN
FPDLFNVDFT AEMEGELDKV AAGEDEYEKV LDSFYRPLET ALSVRKEDPL IPQNRDAALC
EKCGEGHMIV KWTQSGKFLG CSRYPKCRNI KPISTNREKP KDTGILCPSC GEGHMLLRNG
RLGPFLACSS YPKCNTLLNL SKQRHVEPMK IPPVQTDLAC PKCGSPMNLR VGKRGPWLGC
SKFPKCRGRM AWNSLDEGVR AHWEAVMEEH RKAHPSVTLM MLDGTPAPMQ MAVDDIIQMA
EESGMVEIAV EAPEEAVAEK QD