Gene Cphamn1_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0521 
Symbol 
ID6374185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp546782 
End bp549628 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content50% 
IMG OID642683038 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001958965 
Protein GI189499495 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.276548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCAT TCAGCCACAT CAACATACGC GGAGCGAGAG TTCACAATCT CAAGAACATC 
TCTCTTGATA TTCCGCGAAA CAAATTCGTC GTCATTACGG GAATCTCCGG ATCGGGGAAA
TCAAGCCTGG CGTTTGACAC GATCTATGCT GAAGGACAAC GGCGCTTCAT GGAGACCCTT
TCAGCCTACG CTCGGCAATA TATCGGCACT ATAGAGCGCC CTGATGTCGA CCTGATAGAA
GGGCTCTCCC CGGTTATCGC AATTGACCAG AAAAGCACCA GCCGTTCTCC CCGCTCAACT
GTAGGAACAA TTACAGAAAT TCACGATTTC ATCCGTCTGC TCTATGCAAA AGCAGGGAGA
TGTCATGATC CCGTCACAGG AGAAGTGCTC CGGAAACAAT CCGAAGATTC GATCACTGAC
GCTATTCTTT CCCTTCCTGA AGGAACAAAA GTCTCTATCC TTTCTCCTCT CATAACCGGC
AGAAAAGGCC ACTACCGGGA ACTCTTCGAA AGGCTTCTGC AGAAAGGGTT CCTCAGAGTG
CGCGTAGACG GACAGTTCAG CGAGATGGAA AAAGGCATGC AGCTCGAACG CTATAAAAGC
CATAACATCG AACTGGTAGT CGACAGGTTT GTCATTCAGC ATGAGATCAG AGAACGCCTG
AAACAGGCAG TATCCCTTGC CGTCAGCATG TCGGAACATA AATCAGCCGT CATTTGCGCC
CCGCTTGAAA GCAACGTGGA AGAGCGGTCA TACAGCACCA AATTAGCTTA CTCGGACGGT
TCCGCCCCGC TTGACACGCT GGCCCCGAAC AATTTCAGTT TCAATTCGCC CTATGGAGCG
TGCCCGGAGT GCAACGGTCT TGGAGAAATA AAAAATCTTT CTCCCGACCT GATGATACCT
GACAGGAATC TTTCCCTGAA CCAGGGAGGT ATCGAACCTT TTGGAAAACC GGGAAAGCGC
AATCTCTGGC ACATCATAAA AGCGGTTGCG AAACGATACG GGTTCACTCT GGACACTCAA
CTTGCAAAAA TACCATCCGA AGCTCTCGAT ATATTGCTGA AAGGGTCCGG CTCAACTACG
TTTGATGTCA CCTACAGCTA TGCAGGAAAA GAGTCAGTGT ATCCGCAGAT ATTTCCGGGC
GCTGTTGCTT ACGTCGCAGA GATGCTGAAA AACTCGAACT CGTCGAAAAT AAGGGAGTGG
TGTGAAGGGT TCATGCTAAA GCAGCCCTGT CCTGCATGTG GCGGAGCCAG GCTCCGCAAA
GAAAGTCTGC ATGTTACAAT TAACGACCTG AACATCCATG AACTCGAGTC CCTGCCGTTA
CAGGACACTC TTGATTTCTT TTCTGCTCTT CCCACTCATC TGACGAACAA AGAACGACTT
GTCGCCACTC CGATCCTTCA TGAAATAACC AAACGCCTTG AGTTCCTTCT CAACGTCGGG
TTAGGCTATC TCAGCCTGAG CAGGGGTTCG CAGACACTTT CAGGAGGCGA AGCCCAGAGA
ATCCGGCTGG CGTCTCAACT CGGCTCACAA CTGAGCGGAG TACTCTACGT GCTTGATGAA
CCCAGCATCG GTCTGCACCA GCGAGACAAT ATGAAGCTGA TAGAATCGCT GCAGCGGCTG
CGGGACATCG GTAATACCGT CCTTGTCGTC GAGCACGATA AAGATACCAT GCTTATGGCC
GATGAGATCA TCGATCTCGG ACCCGGCGCA GGTGAACATG GTGGAGAAAT CGTGGTACAT
GGCCATGCGT CGAAACTGTC TGAAAATTCC GTAACAGCAG CATACCTCAG AGGTGAGAAA
AAGGTTCGCT CGGAGAAAAA CAGCAAAAAT ACAGACGAAA CAAAGGCGAT CACGCTTCAG
GGATGCAGGG GAAACAATCT GAAAAATATC GATATCAGGC TACCCCTCGG CACACTCATC
TGCGTTACCG GCGTTAGCGG ATCCGGTAAG TCAACCCTGA TAAACGAGAC GCTGCACCCG
ATTCTCGCCA GGCATTTTTT CCGTTCAAAA CTGCTCACCC TCCCCTATGA TACTATCGAC
GGGATCAGGA ATATCGATAA GGTGGTCAAT GTGGACCAGT CTCCTATAGG AAGAACTCCC
CGGTCGAACC CCGCGACCTA TACCGGAGCG TTCACCTTTG TAAGAGAATT TTTCGCGTTG
CTTCCCGAAG CACAGATCCG GGGTTACAAG CCGGGAAGGT TCAGCTTCAA TGTCAGGGGG
GGGCGTTGCG AGACCTGTCA GGGAGCGGGA ACTAAAAAAA TCGAAATGAA CTTTCTGCCC
GATGTCTATG TCCAATGTGA TTCGTGCAAG GGCCGGCGTT ATAACAGGGA AACCCTCCAG
GTGAAGTACA ACGGTAAATC CATTGCAGAC GTTCTGGACA TGACCGTCGA GGATGCTGTG
GCGTTCTTTT CGGACTTCCC CCGTATAAAA CGCATCCTCT CAACCATGCA GAGCGTGGGG
CTGGGATATA TTAAACTTGG CCAGCCCTCT CCACTGCTTT CAGGAGGAGA AGCGCAGCGG
ATTAAACTCT CCGCCGAACT TGCTAAAGTA CAAACAGGCA AAACCCTCTA CATTCTCGAT
GAACCCACAA CAGGCCTGCA CTTCCAGGAT ATCCAGCATC TGCTCGATGT TCTGCAGAGA
CTTGTCAACA AAGGCAATAC CGTTATTATT ATCGAGCACA ACCTCGATAT CATCAAGAAC
GCTGACTGGA TAATCGACCT CGGACCTGAA GGTGGAGAGA AAGGAGGGCA ACTTGTCGCA
GAAGGCACTC CCGCAGAGAT TGCCGGGAAT TCAGGATCAT ATACCGGACA ATTCCTTGCA
GAAGAACTGA AGAAGTTCGA TAGCTGA
 
Protein sequence
MSAFSHINIR GARVHNLKNI SLDIPRNKFV VITGISGSGK SSLAFDTIYA EGQRRFMETL 
SAYARQYIGT IERPDVDLIE GLSPVIAIDQ KSTSRSPRST VGTITEIHDF IRLLYAKAGR
CHDPVTGEVL RKQSEDSITD AILSLPEGTK VSILSPLITG RKGHYRELFE RLLQKGFLRV
RVDGQFSEME KGMQLERYKS HNIELVVDRF VIQHEIRERL KQAVSLAVSM SEHKSAVICA
PLESNVEERS YSTKLAYSDG SAPLDTLAPN NFSFNSPYGA CPECNGLGEI KNLSPDLMIP
DRNLSLNQGG IEPFGKPGKR NLWHIIKAVA KRYGFTLDTQ LAKIPSEALD ILLKGSGSTT
FDVTYSYAGK ESVYPQIFPG AVAYVAEMLK NSNSSKIREW CEGFMLKQPC PACGGARLRK
ESLHVTINDL NIHELESLPL QDTLDFFSAL PTHLTNKERL VATPILHEIT KRLEFLLNVG
LGYLSLSRGS QTLSGGEAQR IRLASQLGSQ LSGVLYVLDE PSIGLHQRDN MKLIESLQRL
RDIGNTVLVV EHDKDTMLMA DEIIDLGPGA GEHGGEIVVH GHASKLSENS VTAAYLRGEK
KVRSEKNSKN TDETKAITLQ GCRGNNLKNI DIRLPLGTLI CVTGVSGSGK STLINETLHP
ILARHFFRSK LLTLPYDTID GIRNIDKVVN VDQSPIGRTP RSNPATYTGA FTFVREFFAL
LPEAQIRGYK PGRFSFNVRG GRCETCQGAG TKKIEMNFLP DVYVQCDSCK GRRYNRETLQ
VKYNGKSIAD VLDMTVEDAV AFFSDFPRIK RILSTMQSVG LGYIKLGQPS PLLSGGEAQR
IKLSAELAKV QTGKTLYILD EPTTGLHFQD IQHLLDVLQR LVNKGNTVII IEHNLDIIKN
ADWIIDLGPE GGEKGGQLVA EGTPAEIAGN SGSYTGQFLA EELKKFDS