Gene Cphamn1_2463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2463 
Symbol 
ID6376158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2626484 
End bp2628898 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content51% 
IMG OID642684940 
ProductDNA topoisomerase I 
Protein accessionYP_001960838 
Protein GI189501368 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCTT CTCCAGCAAG CTCGTCAGCA AAAGGAAAAA CACTGATTGT TGTCGAATCC 
CCGTCAAAGG CTAAAACCAT TAATAAATAT CTTGGTTCAG ATTACCTGGT GTTCGCCTCT
GTCGGCCATA TTAAAGACCT TCCCAAAAAG GAGATCGGAC TGGATTTCGA CAATAATTAT
GAACCCCGCT ATGAAGTTAT TTCCGGCAAG GAAAAGGTTG TCCGTCAGCT GAAAAAGCTT
GCCGGTCAAG CCAATGATAT TCTGATAGCC ACTGACCCTG ACCGTGAGGG AGAGGCTATC
GCGTGGCATA TTGCCAACGA GGTCGATGCT GCACAAAAAC CTGTCCATAG GGTGCTGTTT
AACGAGATCA CGAAAAAAGC CATCCTCGAA GCGATCAACG ACCCGCTCCA GATTGACTAC
CGTCTGGTTC GCTCTCAGCA GACCCGACAG GGCCTGGATA AAATCGTCGG GTACAAAATA
AGCCCTTTTC TCTGGAATGT GGTTATGCGG GGATTATCCG CCGGAAGGGT GCAATCCGTT
TCCCTGAGAC TCATCTGCGA ACGGGAGGCG GAAATCGATA AGTTTGAACC CAAAGAGTAC
TGGACTATCT TTGCCGATTT CACGACCGGA TCAGGAGAAA CCTTCTCGAC CAAACTGGTC
AGAATCGACA GTAACAAGGC GGAAATCACC AATCAGTCCG ACGCCGAGAA AACAGCGTCG
GACATTCTTT CACGAATCTA TGGCGTCGCT GAAATCACCC CGAGGGTACA GCAGCGTAAA
CCGCCGTTTC CTTTCACGAC ATCGCTTCTG CAGCAGGCCG CGTCAAACCA GCTCGGATTC
GGCTCAAAAA AGACCATGCG TGTCGCGCAG CAGCTCTACG AAGGTATTGA TCTCGGCCCT
GAAGGCGCCA CCGGCCTGAT CACCTATATG AGGACAGACT CTACCAGGGT CAGCGGTGAA
GCTGTAGAAC AGGCTGAACG CTTCATCACG CAGCACTTCG GCCCTGAGTT TACAGGCGGC
GGCCAGGCAG CAAAGAAAGG AAAAAAAACA CAGGACGCTC ATGAAGCTAT CCGGCCAACC
GGGGTTCCCC GTACCCCGGA AACGATGAAA CCGTTTCTTT CTTCGGATCA GTACAAGCTC
TACGAGCTGA TATGGAAAAG GTTTCTGGCT TCACGTATGG CGCCGGCAAA AATTGAACAG
ACACGCGTGG ATGTCGCCGA CCACGAGAAA CAGTTTATCT TTCGCGCAAG CGGAAGCAAC
GTGCTTTTTC CAGGCTTTCT GAAAGTCTAC AACGAGCAGA AAGAACTTGA CTACGAGGCA
CGGAAATCAA CCCGTGAAGA TGAGGAAAAA GAGCAGATAG TCAAACTTCC CCGAAATCTG
ACGGTGAATG AAAAGCTGAC TCTGGATACT CTTGACAAAA AACAGAGTTT CACCCGGCCG
CCCGCACGAT TCACCGAGGC AAGTCTTGTC AAGGAACTTG ATAATTACGG TATCGGCAGA
CCCTCAACCT ATGCAGGCAT CTTCTCGACG CTGCAGGATC GGAGGTATGT GGAATTAGAG
AAGAAAAAAA TCATTCCGAC CGAGTTGGGA AAAGATGTAT CGATGATCCT TGTCGCCAAC
TTTCCTGACC TTTTCAATGT GACGTTTACC GCCCACATGG AAAACGAGCT GGACAAGATT
GCCGCGGGAG ATGATGAGTA TGAACAGGTT CTCGACTCCT TCTACAAGCC TCTTGAATCA
GCTCTGAGCG TCAGAAAAAA CGATCCTCTT CTCCCTCAGA ACACCGATGC GGAAACCTGC
GACAAATGCG GTAAGGGAAA AATGATAATC AAGTGGACCG CCAGCGGTAA ATTTCTCGGG
TGTTCCTCCT ATCCTGCATG CAAAAACATC AAACCGCTCG CATCGTCCAG GGTCCGCCCG
AAGGAAACCG GTATAAAATG CCACGGGTGT GAAAACGGAA GAATGGTTCT GAGAAACGGC
AGGTTCGGCC CGTTCCTTGC CTGCACCGGC TACCCTGACT GCAACACGTT GCTGAAACTT
GACAAACAGC GGAAAATAGA GCCCCCCAAA ACCCCACCGC TTGAAACCGA CCTGGCGTGC
CCGAAATGCG GCGCGCCGCT TTACCTGAGA ACCGGAAAAA GAGGGTTGTG GCTCGGATGC
TCGAAGTTCC CGAAATGCAG AGGGAGACAG GCCTGGGGGC AGCTCGATCC GGCCATACAG
CATCACTGGC AGGAAATTAT GGAAAAACAT CAGGGAGAGC ATCCTTCTGT GACCATTCGC
ATGGTTGACG GCACACCGGT CAACATGCAA CTGACCATCG ATGACATCAT ATCCCAGGCT
GAGGAGAACG GACTGGTGGA GATTGAAAAC AATGAGGAGC AGCCGGTGGG CTCACCAGTC
ACCAGTAGGC AGTAG
 
Protein sequence
MASSPASSSA KGKTLIVVES PSKAKTINKY LGSDYLVFAS VGHIKDLPKK EIGLDFDNNY 
EPRYEVISGK EKVVRQLKKL AGQANDILIA TDPDREGEAI AWHIANEVDA AQKPVHRVLF
NEITKKAILE AINDPLQIDY RLVRSQQTRQ GLDKIVGYKI SPFLWNVVMR GLSAGRVQSV
SLRLICEREA EIDKFEPKEY WTIFADFTTG SGETFSTKLV RIDSNKAEIT NQSDAEKTAS
DILSRIYGVA EITPRVQQRK PPFPFTTSLL QQAASNQLGF GSKKTMRVAQ QLYEGIDLGP
EGATGLITYM RTDSTRVSGE AVEQAERFIT QHFGPEFTGG GQAAKKGKKT QDAHEAIRPT
GVPRTPETMK PFLSSDQYKL YELIWKRFLA SRMAPAKIEQ TRVDVADHEK QFIFRASGSN
VLFPGFLKVY NEQKELDYEA RKSTREDEEK EQIVKLPRNL TVNEKLTLDT LDKKQSFTRP
PARFTEASLV KELDNYGIGR PSTYAGIFST LQDRRYVELE KKKIIPTELG KDVSMILVAN
FPDLFNVTFT AHMENELDKI AAGDDEYEQV LDSFYKPLES ALSVRKNDPL LPQNTDAETC
DKCGKGKMII KWTASGKFLG CSSYPACKNI KPLASSRVRP KETGIKCHGC ENGRMVLRNG
RFGPFLACTG YPDCNTLLKL DKQRKIEPPK TPPLETDLAC PKCGAPLYLR TGKRGLWLGC
SKFPKCRGRQ AWGQLDPAIQ HHWQEIMEKH QGEHPSVTIR MVDGTPVNMQ LTIDDIISQA
EENGLVEIEN NEEQPVGSPV TSRQ