Gene Paes_2212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_2212 
Symbol 
ID6459713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2390853 
End bp2393249 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content52% 
IMG OID642726188 
ProductDNA topoisomerase I 
Protein accessionYP_002016861 
Protein GI194335001 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCAT CCTCAGCGGC TCCGTCTGCA AAGAACAAAA CGCTTATTGT TGTCGAATCT 
CCATCAAAAG CGAAAACAAT CAATAAATAT CTCGGTACGG GCTATAAGGT CTTTGCCTCG
GTCGGTCACA TCAAGGACCT TCCGAAAAAA GAGATCGGCC TGGACTTCGA CAACAACTAC
GAACCCCGCT ATGAAGTCAT CGCAGGGAAA GAAAAGGTTG TCCGTCAGCT TCGCAAGCTG
GCCGGGGAAG CAAACGACAT CATGATAGCC ACTGACCCTG ATCGTGAAGG CGAAGCGATA
GCGTGGCACA TTGCCAATGA AGTCGAAGGC GCCCGAAAAC CGGTGCAGCG GGTCCTGTTC
AATGAGATTA CGAAAAACGC CATTATCGAT GCGATAAGCC ATCCCCGCCA GATCGACTAC
CGCCTCGTTC GTTCCCAGCA GACCCGTCAG GGACTCGACA AGATCGTCGG CTACAAAATC
AGCCCGTTTC TGTGGAACGT TGTCTTCCGC GGGCTGTCGG CGGGAAGAGT TCAATCCGTC
GCCCTCAGGC TGATCTGCGA ACGCGAAGAT GAAATCGGAA AGTTCGAAAC CAGAGAATAC
TGGACACTGT TTGCCGATTT CACGACCGTA TCAGGGGAAA CATTCTCTGC CAAACTGGTC
AAAATCAACG GCAAAGATGT CGATATCACC AACGAGAACG ATGCGACCGC CGTTGCCGAC
ACCATTCTCT CGCGCATCTA CGGCGTGACG GATATCTCCT CAAGAATCCA GCAGCGTAAA
CCGCCCGTTG CATTTACCAC CTCACTGCTG CAGCAGGCGG CATCGAACCA GCTCGGATTC
GGTTCAAAGA AAACCATGCG TGCGGCACAG CAGCTTTATG AAGGTATCGA ACTCGGCCCC
GAAGGCGCTA CCGGTCTGAT CACCTACATG AGAACCGATT CGACAAGGGT CAGCGCTGAA
GCTGTCGGTC AGGCACAAAG TTATATCGAA CAGCAGTTCG GCCCGCAATA CGCCGGGCAG
GGCAGAGCGG GAAAAAACGA CAAAAAAACC CAGGATGCAC ACGAAGCTAT CCGCCCGACA
TCGATATTCC GCACTCCGGC GGCAATGAAA CCGTTTCTGA CGGCAGACCA GTTCAAACTC
TACGAACTGA TCTGGAAACG CTTTCTCGCA TCCAGAATGG CTCCTGCAAA AATCGAACTG
ACCAAGGTAG AGGTCTCTGA TCATGAAGGA GAATTTCTCT TTCGCGCCAA TGGCAGCAAG
GTGCTCTTTC CCGGCTTTCT CCAGGTCTAC AGCGATCAGA AGGAGCTCGA TTACGAAGCA
AAAACATCGA CCAAGGATGA CGAGGAAAAA GAGCAAACCG TTCAGCTTCC AAAAAAACTT
GATCTGAACG AAAAACTCGA CTTGCAAACA CTTGATAAAA AACAGAGCTT CACCCGACCG
CCCGCCCGCT ACACCGAAGC AAGCCTTGTC AAAGAGCTCG ACAACCACGG CATTGGACGA
CCCTCGACCT ATGCGGCAAT TTTCTCGACC CTTCAGGACC GGCGATATGT TGAATTGCTG
AAACGAAAAA TCATTCCTAC CGAACTCGGT CGGGATGTAT CATTGATCCT GGTTGCCAAT
TTCCCTGATC TGTTTAACGT GACCTTTACT GCGCAGATGG AGGATGAACT CGATAAAGTC
GCTGCCGGCC AGGACGACTA TGAAAAAGTG CTCGACAGCT TCTATAAACC GCTCGAAGCC
TCTCTCTCAG ACCGTAAAAA AGATCCGGTT CTTCCGCAGA ACGACAAAGC GGAACGGTGC
GACAAGTGCG GAACAGGAAA AATGGTGGTC AAATGGACCA GTAGCGGCAA GTTTCTCGGA
TGTTCTGAAT ATCCGAAATG CAAAAACATA AAACCGCTCA GCAATTCGAA ACCAAAACCA
AAAGGCACCG GCATCGCCTG TCCCGGATGC GAAGACGGAC ACATGGTGCT GCGCGACGGG
CGCTTCGGCC CGTTTCTTGC GTGCTCCAAC TACCCTAAAT GCAACACGCT GTTGAACCTC
GACAAGCAGC GACGCATCCA GCCGCCGAAA ACACCTCCAC TCGAAACCGA TCTTTCCTGT
CCGAAATGCG GTGCCCCCCT CTATTTACGA ACAGGAAAAC GCGGACTGTG GCTGGGGTGT
TCGAAATTCC CTAAATGCAG GGGTCGTCAG GCCTGGGGAC AGCTTGAGCC CGCCCTGCAG
CAACACTGGC AAAGTGTCAT GGATCAGCAT CTTGCCGCTC ATCCACAGGT GACGCTTCTG
ATGACCGACG GTTCACCTGT GAACATGCAG CTCACGGTCG ACGAGATCAT GGTATTTGCC
GAAGAAAAAG GTCTCATCCC CTTGATGGAG GAGCAGAAAA CCGGGGTCAG TTCATAA
 
Protein sequence
MASSSAAPSA KNKTLIVVES PSKAKTINKY LGTGYKVFAS VGHIKDLPKK EIGLDFDNNY 
EPRYEVIAGK EKVVRQLRKL AGEANDIMIA TDPDREGEAI AWHIANEVEG ARKPVQRVLF
NEITKNAIID AISHPRQIDY RLVRSQQTRQ GLDKIVGYKI SPFLWNVVFR GLSAGRVQSV
ALRLICERED EIGKFETREY WTLFADFTTV SGETFSAKLV KINGKDVDIT NENDATAVAD
TILSRIYGVT DISSRIQQRK PPVAFTTSLL QQAASNQLGF GSKKTMRAAQ QLYEGIELGP
EGATGLITYM RTDSTRVSAE AVGQAQSYIE QQFGPQYAGQ GRAGKNDKKT QDAHEAIRPT
SIFRTPAAMK PFLTADQFKL YELIWKRFLA SRMAPAKIEL TKVEVSDHEG EFLFRANGSK
VLFPGFLQVY SDQKELDYEA KTSTKDDEEK EQTVQLPKKL DLNEKLDLQT LDKKQSFTRP
PARYTEASLV KELDNHGIGR PSTYAAIFST LQDRRYVELL KRKIIPTELG RDVSLILVAN
FPDLFNVTFT AQMEDELDKV AAGQDDYEKV LDSFYKPLEA SLSDRKKDPV LPQNDKAERC
DKCGTGKMVV KWTSSGKFLG CSEYPKCKNI KPLSNSKPKP KGTGIACPGC EDGHMVLRDG
RFGPFLACSN YPKCNTLLNL DKQRRIQPPK TPPLETDLSC PKCGAPLYLR TGKRGLWLGC
SKFPKCRGRQ AWGQLEPALQ QHWQSVMDQH LAAHPQVTLL MTDGSPVNMQ LTVDEIMVFA
EEKGLIPLME EQKTGVSS