Gene Cphamn1_0780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0780 
Symbol 
ID6374447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp834631 
End bp837477 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content50% 
IMG OID642683288 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001959212 
Protein GI189499742 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0344359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.338126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAA AGAGAGCTGT TTCGGGAGAA GACCTGTCCC TTCCCGATAT TGTTATGAAA 
GGAGTCAGTA CACATAACCT GAAGAATATA TCTGTACGGA TTCCCAGAAA TAAATTTGTC
GTCATTACCG GGGTCAGCGG TTCGGGGAAA TCAAGTCTTG CCTTCGATAC TCTTTACGCT
GAAGGACATC GAAGGTATGT TGAGTCACTC TCCGCATATA TTCGCCAGTT TCTGGAACGA
ATGCCCAGGC CGGATATCGC ATCGATAGAA GGAATCGCCC CCGCGATTGC CATTGAACAA
AAAGCTATCC CCAAAAATCC GCGTTCGACA GTAGGTACCG TTTCGGAAAT CTATGATTAT
CTCAGGTTGC TTTTTGCCCG GATAGGAAAG ATCTATTCTG AGGATACCAA TGAGCTCGTG
CTGAAACACG CGCCTGAAGA TGTCGGCATT CAGGCAGACT TTCTCGAGAA AGGAACCCGT
TTTTTTGTCG GTTTTCCCTT TCCGTGTCAT ACAGATGTTG CCCGGCACCG TTGTCCGGTT
GATGAGGAAT TGCAGAATCT TCTGCAGAAA GGCTTTTTTC GTCTGATATA CCAGGACAAG
GTGCTCGACA TCAATGACGT TTCGGTTCGT GAGCGCATTG CCGGTATGCG TGCCGATGAG
ATATCGGAGG TTCTTGTACT TGTCGACCGG TTCAAGGCTG TTGGAGATGA AAAAACAATG
AGCAGGGTGT CCCAGGCGGC TGAAATCAGT TTCAACGAGT CGAGCGGATA CGCCGTGCTG
AAAGTTGCAG GCGGCAAGAC CTTTCGTTTC AGCGACCGTC TTGAATTGAA CGGCGTTGAA
TATCAGGATC CTGCACCGCA GCTTTTTGCC TTTAATTCCC CGCTCGGAGC ATGTCCGGAG
TGTCAGGGTT TCGGAAGACT TGCAGGCATT GACGAAGATG CGGTAGTGCC GAACAGATCG
TTGAGTCTTG CAGAAGGGGC CATTGCATGC TGGAACTCGG AGAAGTACCG CAGACATCTC
AGAAAGCTGC TTGAGATCGC CCGGGAGGCC GGGATTCCTG TTGACCGGCC CTACGAGAAG
CTGTCCCATG TTCATAAGGA TCTCATCTGG AAGGGCATAA AACGGAAGGG ATACAAGGGT
ATCCGGCCTT TTTTTGCAGA AATAGAAAAG GACGCGGGAT ATAAAATGCA TCTGCGGGTT
TTTCTCAGCC GATACAGGGG ATATGCTGTC TGTACTGCTT GTGAAGGTAG CAGGGTAAAG
CCGGAAGCGA GGTGTGTGCG GGTTTCCGGT AAAAACATCG GTGAAGTCAG CAGGATGAAC
CTTGCGGAAG CTCACGGTTT TTTCAGTGAT CTCGCTATAT CTCCATTCGA CAGAAAGGTT
GCGGGAGCTG TCCTGCTTGA AATTCAGAAA CGCCTGAGAT ATATGCTCGA TGTCGGTCTT
GACTATCTGA CTCTTGACCG GCTGACCCAT ACGTTGAGCG GAGGGGAGTT TCAGCGGATC
AACCTCTCAA CCTCTCTCGG ATCACCTCTT GTCGGAGCGA TGTATATTCT TGACGAACCA
AGTATCGGCC TGCATCAGAG CGACTCGGCA CGGTTGATCG GTTTGTTGAA GCGGTTACGT
GATCTTGGAA ATACGGTGAT TGTTGTCGAG CATGACAGGG AGATTATGGA AGAGGCGGAC
GAAATAATAG ATCTTGGCCC GAAAGCCGGA AGGATGGGAG GGGAGGTTGT TTTTCATGGA
ACGCCTGACG CTCTGCTCGA AACCGGAAAT TCTCTCACGG CAGAGTATCT TACCGGAAGA
AAAATCATAC CTGTTCCATC AAAAAGGCGT GAGCCTGATT TTTCACGATG CATCGTGGTC
ACCGGCGCCA TGCAGAACAA TCTCAAGAGT ATCGATGTCC GGTTTCCACT GGGGATCATG
ACCTGTGTGA CCGGTGTCAG CGGCTCGGGT AAGTCAACGC TTGTCAATGA TATTCTTAAC
AAAGGGATTG TCCGGGCAAA AGAACATTCA GGAGAAAAAG CCGGAACCCA CCGTCTTATT
ACCGGAACGG AGCTGGTGCA AGCTGTTGAG CATGTAGACC AGTCACCGAT CGGCAAGTCA
AGCAGAAGCA ACCCTGTGAC CTATCTGAAG ATTTTTGATG ATATCCGGAG CCTGTTTTCC
CGGACAAGAG ACGCCAGATC AAGAGGATGG AAACAGGGAT ACTTTTCATT TAATATTCCT
GGTGGTCGTT GTGAAGCCTG TGCCGGAGAA GGTACAGTCC GCATTGAGAT GCAGTTTCTG
GCCGATATCG AAGCCGTATG CGAAGAGTGC GGGGGTAAAC GCTATAAAAG CGATACACTT
GATATTCGCT TCAAGGGATT ATCCATCTCT GACGTTCTGG AGCTCACTGT GGAGGAGGCT
CTGGATGTTT TTTCTTCTGA AAAAAACATT CTTCGCAAGC TCAAAGTTCT CGATGAGGTC
GGGCTTGGCT ACATCCGTCT GGGCCAGTCA TCCAACACGC TTTCCGGAGG AGAAGCGCAG
CGGCTCAAGC TGGCTTTTTT TATCGCGAAG GCTGATGTGG AACACACGCT CTTTATTTTT
GACGAACCGA CGACAGGGCT TCATTTTGAG GATATTCTTA AACTGATTGA CTGTTTTGAA
CGGCTTCTGG CACAGAACAA CTCGCTGGTG ATCATTGAGC ACAATCCGGA CATTATTAAA
CAGGCCGACT GGGTGATTGA TCTCGGCCCC GGCGCCGGAG ACAAGGGTGG GGAAGTTGTT
GCAGAAGGAA CGCCTGAATC GATATGCGGA AATTCAGCGT CTCTTACCGG ACTTCATCTG
AAGCCCTGGC TTGAAGGAGG GGAGTGA
 
Protein sequence
MQKKRAVSGE DLSLPDIVMK GVSTHNLKNI SVRIPRNKFV VITGVSGSGK SSLAFDTLYA 
EGHRRYVESL SAYIRQFLER MPRPDIASIE GIAPAIAIEQ KAIPKNPRST VGTVSEIYDY
LRLLFARIGK IYSEDTNELV LKHAPEDVGI QADFLEKGTR FFVGFPFPCH TDVARHRCPV
DEELQNLLQK GFFRLIYQDK VLDINDVSVR ERIAGMRADE ISEVLVLVDR FKAVGDEKTM
SRVSQAAEIS FNESSGYAVL KVAGGKTFRF SDRLELNGVE YQDPAPQLFA FNSPLGACPE
CQGFGRLAGI DEDAVVPNRS LSLAEGAIAC WNSEKYRRHL RKLLEIAREA GIPVDRPYEK
LSHVHKDLIW KGIKRKGYKG IRPFFAEIEK DAGYKMHLRV FLSRYRGYAV CTACEGSRVK
PEARCVRVSG KNIGEVSRMN LAEAHGFFSD LAISPFDRKV AGAVLLEIQK RLRYMLDVGL
DYLTLDRLTH TLSGGEFQRI NLSTSLGSPL VGAMYILDEP SIGLHQSDSA RLIGLLKRLR
DLGNTVIVVE HDREIMEEAD EIIDLGPKAG RMGGEVVFHG TPDALLETGN SLTAEYLTGR
KIIPVPSKRR EPDFSRCIVV TGAMQNNLKS IDVRFPLGIM TCVTGVSGSG KSTLVNDILN
KGIVRAKEHS GEKAGTHRLI TGTELVQAVE HVDQSPIGKS SRSNPVTYLK IFDDIRSLFS
RTRDARSRGW KQGYFSFNIP GGRCEACAGE GTVRIEMQFL ADIEAVCEEC GGKRYKSDTL
DIRFKGLSIS DVLELTVEEA LDVFSSEKNI LRKLKVLDEV GLGYIRLGQS SNTLSGGEAQ
RLKLAFFIAK ADVEHTLFIF DEPTTGLHFE DILKLIDCFE RLLAQNNSLV IIEHNPDIIK
QADWVIDLGP GAGDKGGEVV AEGTPESICG NSASLTGLHL KPWLEGGE