Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0780 |
Symbol | |
ID | 6374447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 834631 |
End bp | 837477 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642683288 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_001959212 |
Protein GI | 189499742 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0344359 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.338126 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAAAA AGAGAGCTGT TTCGGGAGAA GACCTGTCCC TTCCCGATAT TGTTATGAAA GGAGTCAGTA CACATAACCT GAAGAATATA TCTGTACGGA TTCCCAGAAA TAAATTTGTC GTCATTACCG GGGTCAGCGG TTCGGGGAAA TCAAGTCTTG CCTTCGATAC TCTTTACGCT GAAGGACATC GAAGGTATGT TGAGTCACTC TCCGCATATA TTCGCCAGTT TCTGGAACGA ATGCCCAGGC CGGATATCGC ATCGATAGAA GGAATCGCCC CCGCGATTGC CATTGAACAA AAAGCTATCC CCAAAAATCC GCGTTCGACA GTAGGTACCG TTTCGGAAAT CTATGATTAT CTCAGGTTGC TTTTTGCCCG GATAGGAAAG ATCTATTCTG AGGATACCAA TGAGCTCGTG CTGAAACACG CGCCTGAAGA TGTCGGCATT CAGGCAGACT TTCTCGAGAA AGGAACCCGT TTTTTTGTCG GTTTTCCCTT TCCGTGTCAT ACAGATGTTG CCCGGCACCG TTGTCCGGTT GATGAGGAAT TGCAGAATCT TCTGCAGAAA GGCTTTTTTC GTCTGATATA CCAGGACAAG GTGCTCGACA TCAATGACGT TTCGGTTCGT GAGCGCATTG CCGGTATGCG TGCCGATGAG ATATCGGAGG TTCTTGTACT TGTCGACCGG TTCAAGGCTG TTGGAGATGA AAAAACAATG AGCAGGGTGT CCCAGGCGGC TGAAATCAGT TTCAACGAGT CGAGCGGATA CGCCGTGCTG AAAGTTGCAG GCGGCAAGAC CTTTCGTTTC AGCGACCGTC TTGAATTGAA CGGCGTTGAA TATCAGGATC CTGCACCGCA GCTTTTTGCC TTTAATTCCC CGCTCGGAGC ATGTCCGGAG TGTCAGGGTT TCGGAAGACT TGCAGGCATT GACGAAGATG CGGTAGTGCC GAACAGATCG TTGAGTCTTG CAGAAGGGGC CATTGCATGC TGGAACTCGG AGAAGTACCG CAGACATCTC AGAAAGCTGC TTGAGATCGC CCGGGAGGCC GGGATTCCTG TTGACCGGCC CTACGAGAAG CTGTCCCATG TTCATAAGGA TCTCATCTGG AAGGGCATAA AACGGAAGGG ATACAAGGGT ATCCGGCCTT TTTTTGCAGA AATAGAAAAG GACGCGGGAT ATAAAATGCA TCTGCGGGTT TTTCTCAGCC GATACAGGGG ATATGCTGTC TGTACTGCTT GTGAAGGTAG CAGGGTAAAG CCGGAAGCGA GGTGTGTGCG GGTTTCCGGT AAAAACATCG GTGAAGTCAG CAGGATGAAC CTTGCGGAAG CTCACGGTTT TTTCAGTGAT CTCGCTATAT CTCCATTCGA CAGAAAGGTT GCGGGAGCTG TCCTGCTTGA AATTCAGAAA CGCCTGAGAT ATATGCTCGA TGTCGGTCTT GACTATCTGA CTCTTGACCG GCTGACCCAT ACGTTGAGCG GAGGGGAGTT TCAGCGGATC AACCTCTCAA CCTCTCTCGG ATCACCTCTT GTCGGAGCGA TGTATATTCT TGACGAACCA AGTATCGGCC TGCATCAGAG CGACTCGGCA CGGTTGATCG GTTTGTTGAA GCGGTTACGT GATCTTGGAA ATACGGTGAT TGTTGTCGAG CATGACAGGG AGATTATGGA AGAGGCGGAC GAAATAATAG ATCTTGGCCC GAAAGCCGGA AGGATGGGAG GGGAGGTTGT TTTTCATGGA ACGCCTGACG CTCTGCTCGA AACCGGAAAT TCTCTCACGG CAGAGTATCT TACCGGAAGA AAAATCATAC CTGTTCCATC AAAAAGGCGT GAGCCTGATT TTTCACGATG CATCGTGGTC ACCGGCGCCA TGCAGAACAA TCTCAAGAGT ATCGATGTCC GGTTTCCACT GGGGATCATG ACCTGTGTGA CCGGTGTCAG CGGCTCGGGT AAGTCAACGC TTGTCAATGA TATTCTTAAC AAAGGGATTG TCCGGGCAAA AGAACATTCA GGAGAAAAAG CCGGAACCCA CCGTCTTATT ACCGGAACGG AGCTGGTGCA AGCTGTTGAG CATGTAGACC AGTCACCGAT CGGCAAGTCA AGCAGAAGCA ACCCTGTGAC CTATCTGAAG ATTTTTGATG ATATCCGGAG CCTGTTTTCC CGGACAAGAG ACGCCAGATC AAGAGGATGG AAACAGGGAT ACTTTTCATT TAATATTCCT GGTGGTCGTT GTGAAGCCTG TGCCGGAGAA GGTACAGTCC GCATTGAGAT GCAGTTTCTG GCCGATATCG AAGCCGTATG CGAAGAGTGC GGGGGTAAAC GCTATAAAAG CGATACACTT GATATTCGCT TCAAGGGATT ATCCATCTCT GACGTTCTGG AGCTCACTGT GGAGGAGGCT CTGGATGTTT TTTCTTCTGA AAAAAACATT CTTCGCAAGC TCAAAGTTCT CGATGAGGTC GGGCTTGGCT ACATCCGTCT GGGCCAGTCA TCCAACACGC TTTCCGGAGG AGAAGCGCAG CGGCTCAAGC TGGCTTTTTT TATCGCGAAG GCTGATGTGG AACACACGCT CTTTATTTTT GACGAACCGA CGACAGGGCT TCATTTTGAG GATATTCTTA AACTGATTGA CTGTTTTGAA CGGCTTCTGG CACAGAACAA CTCGCTGGTG ATCATTGAGC ACAATCCGGA CATTATTAAA CAGGCCGACT GGGTGATTGA TCTCGGCCCC GGCGCCGGAG ACAAGGGTGG GGAAGTTGTT GCAGAAGGAA CGCCTGAATC GATATGCGGA AATTCAGCGT CTCTTACCGG ACTTCATCTG AAGCCCTGGC TTGAAGGAGG GGAGTGA
|
Protein sequence | MQKKRAVSGE DLSLPDIVMK GVSTHNLKNI SVRIPRNKFV VITGVSGSGK SSLAFDTLYA EGHRRYVESL SAYIRQFLER MPRPDIASIE GIAPAIAIEQ KAIPKNPRST VGTVSEIYDY LRLLFARIGK IYSEDTNELV LKHAPEDVGI QADFLEKGTR FFVGFPFPCH TDVARHRCPV DEELQNLLQK GFFRLIYQDK VLDINDVSVR ERIAGMRADE ISEVLVLVDR FKAVGDEKTM SRVSQAAEIS FNESSGYAVL KVAGGKTFRF SDRLELNGVE YQDPAPQLFA FNSPLGACPE CQGFGRLAGI DEDAVVPNRS LSLAEGAIAC WNSEKYRRHL RKLLEIAREA GIPVDRPYEK LSHVHKDLIW KGIKRKGYKG IRPFFAEIEK DAGYKMHLRV FLSRYRGYAV CTACEGSRVK PEARCVRVSG KNIGEVSRMN LAEAHGFFSD LAISPFDRKV AGAVLLEIQK RLRYMLDVGL DYLTLDRLTH TLSGGEFQRI NLSTSLGSPL VGAMYILDEP SIGLHQSDSA RLIGLLKRLR DLGNTVIVVE HDREIMEEAD EIIDLGPKAG RMGGEVVFHG TPDALLETGN SLTAEYLTGR KIIPVPSKRR EPDFSRCIVV TGAMQNNLKS IDVRFPLGIM TCVTGVSGSG KSTLVNDILN KGIVRAKEHS GEKAGTHRLI TGTELVQAVE HVDQSPIGKS SRSNPVTYLK IFDDIRSLFS RTRDARSRGW KQGYFSFNIP GGRCEACAGE GTVRIEMQFL ADIEAVCEEC GGKRYKSDTL DIRFKGLSIS DVLELTVEEA LDVFSSEKNI LRKLKVLDEV GLGYIRLGQS SNTLSGGEAQ RLKLAFFIAK ADVEHTLFIF DEPTTGLHFE DILKLIDCFE RLLAQNNSLV IIEHNPDIIK QADWVIDLGP GAGDKGGEVV AEGTPESICG NSASLTGLHL KPWLEGGE
|
| |