Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0521 |
Symbol | |
ID | 6374185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 546782 |
End bp | 549628 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642683038 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_001958965 |
Protein GI | 189499495 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.276548 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCAT TCAGCCACAT CAACATACGC GGAGCGAGAG TTCACAATCT CAAGAACATC TCTCTTGATA TTCCGCGAAA CAAATTCGTC GTCATTACGG GAATCTCCGG ATCGGGGAAA TCAAGCCTGG CGTTTGACAC GATCTATGCT GAAGGACAAC GGCGCTTCAT GGAGACCCTT TCAGCCTACG CTCGGCAATA TATCGGCACT ATAGAGCGCC CTGATGTCGA CCTGATAGAA GGGCTCTCCC CGGTTATCGC AATTGACCAG AAAAGCACCA GCCGTTCTCC CCGCTCAACT GTAGGAACAA TTACAGAAAT TCACGATTTC ATCCGTCTGC TCTATGCAAA AGCAGGGAGA TGTCATGATC CCGTCACAGG AGAAGTGCTC CGGAAACAAT CCGAAGATTC GATCACTGAC GCTATTCTTT CCCTTCCTGA AGGAACAAAA GTCTCTATCC TTTCTCCTCT CATAACCGGC AGAAAAGGCC ACTACCGGGA ACTCTTCGAA AGGCTTCTGC AGAAAGGGTT CCTCAGAGTG CGCGTAGACG GACAGTTCAG CGAGATGGAA AAAGGCATGC AGCTCGAACG CTATAAAAGC CATAACATCG AACTGGTAGT CGACAGGTTT GTCATTCAGC ATGAGATCAG AGAACGCCTG AAACAGGCAG TATCCCTTGC CGTCAGCATG TCGGAACATA AATCAGCCGT CATTTGCGCC CCGCTTGAAA GCAACGTGGA AGAGCGGTCA TACAGCACCA AATTAGCTTA CTCGGACGGT TCCGCCCCGC TTGACACGCT GGCCCCGAAC AATTTCAGTT TCAATTCGCC CTATGGAGCG TGCCCGGAGT GCAACGGTCT TGGAGAAATA AAAAATCTTT CTCCCGACCT GATGATACCT GACAGGAATC TTTCCCTGAA CCAGGGAGGT ATCGAACCTT TTGGAAAACC GGGAAAGCGC AATCTCTGGC ACATCATAAA AGCGGTTGCG AAACGATACG GGTTCACTCT GGACACTCAA CTTGCAAAAA TACCATCCGA AGCTCTCGAT ATATTGCTGA AAGGGTCCGG CTCAACTACG TTTGATGTCA CCTACAGCTA TGCAGGAAAA GAGTCAGTGT ATCCGCAGAT ATTTCCGGGC GCTGTTGCTT ACGTCGCAGA GATGCTGAAA AACTCGAACT CGTCGAAAAT AAGGGAGTGG TGTGAAGGGT TCATGCTAAA GCAGCCCTGT CCTGCATGTG GCGGAGCCAG GCTCCGCAAA GAAAGTCTGC ATGTTACAAT TAACGACCTG AACATCCATG AACTCGAGTC CCTGCCGTTA CAGGACACTC TTGATTTCTT TTCTGCTCTT CCCACTCATC TGACGAACAA AGAACGACTT GTCGCCACTC CGATCCTTCA TGAAATAACC AAACGCCTTG AGTTCCTTCT CAACGTCGGG TTAGGCTATC TCAGCCTGAG CAGGGGTTCG CAGACACTTT CAGGAGGCGA AGCCCAGAGA ATCCGGCTGG CGTCTCAACT CGGCTCACAA CTGAGCGGAG TACTCTACGT GCTTGATGAA CCCAGCATCG GTCTGCACCA GCGAGACAAT ATGAAGCTGA TAGAATCGCT GCAGCGGCTG CGGGACATCG GTAATACCGT CCTTGTCGTC GAGCACGATA AAGATACCAT GCTTATGGCC GATGAGATCA TCGATCTCGG ACCCGGCGCA GGTGAACATG GTGGAGAAAT CGTGGTACAT GGCCATGCGT CGAAACTGTC TGAAAATTCC GTAACAGCAG CATACCTCAG AGGTGAGAAA AAGGTTCGCT CGGAGAAAAA CAGCAAAAAT ACAGACGAAA CAAAGGCGAT CACGCTTCAG GGATGCAGGG GAAACAATCT GAAAAATATC GATATCAGGC TACCCCTCGG CACACTCATC TGCGTTACCG GCGTTAGCGG ATCCGGTAAG TCAACCCTGA TAAACGAGAC GCTGCACCCG ATTCTCGCCA GGCATTTTTT CCGTTCAAAA CTGCTCACCC TCCCCTATGA TACTATCGAC GGGATCAGGA ATATCGATAA GGTGGTCAAT GTGGACCAGT CTCCTATAGG AAGAACTCCC CGGTCGAACC CCGCGACCTA TACCGGAGCG TTCACCTTTG TAAGAGAATT TTTCGCGTTG CTTCCCGAAG CACAGATCCG GGGTTACAAG CCGGGAAGGT TCAGCTTCAA TGTCAGGGGG GGGCGTTGCG AGACCTGTCA GGGAGCGGGA ACTAAAAAAA TCGAAATGAA CTTTCTGCCC GATGTCTATG TCCAATGTGA TTCGTGCAAG GGCCGGCGTT ATAACAGGGA AACCCTCCAG GTGAAGTACA ACGGTAAATC CATTGCAGAC GTTCTGGACA TGACCGTCGA GGATGCTGTG GCGTTCTTTT CGGACTTCCC CCGTATAAAA CGCATCCTCT CAACCATGCA GAGCGTGGGG CTGGGATATA TTAAACTTGG CCAGCCCTCT CCACTGCTTT CAGGAGGAGA AGCGCAGCGG ATTAAACTCT CCGCCGAACT TGCTAAAGTA CAAACAGGCA AAACCCTCTA CATTCTCGAT GAACCCACAA CAGGCCTGCA CTTCCAGGAT ATCCAGCATC TGCTCGATGT TCTGCAGAGA CTTGTCAACA AAGGCAATAC CGTTATTATT ATCGAGCACA ACCTCGATAT CATCAAGAAC GCTGACTGGA TAATCGACCT CGGACCTGAA GGTGGAGAGA AAGGAGGGCA ACTTGTCGCA GAAGGCACTC CCGCAGAGAT TGCCGGGAAT TCAGGATCAT ATACCGGACA ATTCCTTGCA GAAGAACTGA AGAAGTTCGA TAGCTGA
|
Protein sequence | MSAFSHINIR GARVHNLKNI SLDIPRNKFV VITGISGSGK SSLAFDTIYA EGQRRFMETL SAYARQYIGT IERPDVDLIE GLSPVIAIDQ KSTSRSPRST VGTITEIHDF IRLLYAKAGR CHDPVTGEVL RKQSEDSITD AILSLPEGTK VSILSPLITG RKGHYRELFE RLLQKGFLRV RVDGQFSEME KGMQLERYKS HNIELVVDRF VIQHEIRERL KQAVSLAVSM SEHKSAVICA PLESNVEERS YSTKLAYSDG SAPLDTLAPN NFSFNSPYGA CPECNGLGEI KNLSPDLMIP DRNLSLNQGG IEPFGKPGKR NLWHIIKAVA KRYGFTLDTQ LAKIPSEALD ILLKGSGSTT FDVTYSYAGK ESVYPQIFPG AVAYVAEMLK NSNSSKIREW CEGFMLKQPC PACGGARLRK ESLHVTINDL NIHELESLPL QDTLDFFSAL PTHLTNKERL VATPILHEIT KRLEFLLNVG LGYLSLSRGS QTLSGGEAQR IRLASQLGSQ LSGVLYVLDE PSIGLHQRDN MKLIESLQRL RDIGNTVLVV EHDKDTMLMA DEIIDLGPGA GEHGGEIVVH GHASKLSENS VTAAYLRGEK KVRSEKNSKN TDETKAITLQ GCRGNNLKNI DIRLPLGTLI CVTGVSGSGK STLINETLHP ILARHFFRSK LLTLPYDTID GIRNIDKVVN VDQSPIGRTP RSNPATYTGA FTFVREFFAL LPEAQIRGYK PGRFSFNVRG GRCETCQGAG TKKIEMNFLP DVYVQCDSCK GRRYNRETLQ VKYNGKSIAD VLDMTVEDAV AFFSDFPRIK RILSTMQSVG LGYIKLGQPS PLLSGGEAQR IKLSAELAKV QTGKTLYILD EPTTGLHFQD IQHLLDVLQR LVNKGNTVII IEHNLDIIKN ADWIIDLGPE GGEKGGQLVA EGTPAEIAGN SGSYTGQFLA EELKKFDS
|
| |