Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0154 |
Symbol | |
ID | 3747719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 171329 |
End bp | 174184 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637772681 |
Product | excinuclease ABC subunit A |
Protein accession | YP_378475 |
Protein GI | 78188137 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.676306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTCA GCCACATTAG CATACGAGGC GCCCGCGTTC ATAACCTCAA GAACATTTCG CTTGATATTC CCCGCAACCA ATTTGTGGTT ATTACAGGGC TTTCAGGTTC AGGAAAATCG AGTCTTGCTT TCGACACCAT TTATGCCGAA GGGCAGCGCC GCTTTATGGA AACGCTCTCC CCTTATGCAC GCCAATATAT TGGCAACATT GAGCGCCCTG ATGTGGATTT TATTGAAGGA CTGTCGCCCG TTATTGCTAT TGATCAAAAA AGTACCAGCC GCTCCCCTCG CTCAACGGTT GGCACTATTA CCGAAATCCA CGACTTTATT CGGTTGCTGT ATGCAAAAGC GGGACGCCGT TACAATCCCG AAACGGGTGC CATGGTGCAA GCACAAAGCG CCGACAACAT TCTTGCAACC ATTCTTGCCC TACCCGAAGG AAGCAAGGTG CAAATTCTTT CACCACTTGT TACAGGGCGA AAAGGGCATT ATCGGGAGCT ATTTGAGCGC TTACGCAGCA AAGGCTTTTT GCGGGTGCGT GTTGATGGCG AATTGCAAGA AATGGTGCCC AACATGCAGC TTGAGCGTTA CAAAAGCCAC ACCATTGAGT TGGTGGTTGA TCGGCTTGTT CTTGCGCCTG AAAGCGAAGC ACGAGTGCGC GAAGCCGTCA TGCTGGCTAT TAGTATCTCG GAGCACAAGT CGTCGGTTAT TTGCACGCCC TTTGAGGGTG GCTTTACCGA GCTTGCTTTT ACGCTCAGCA AAGGGGATAA TGAGGATGCC CTGCCAACAT CAACCTTGGC ACCGAACCAC TTTAGCTTTA ATTCCCCTTA TGGCGCTTGC CCAACCTGTA ACGGATTGGG TGAATTGATG CAGCTTTCGG GTGAATTGAT GATTCCCGAT CCTTCGCTGT CGCTCAATCA AGGTGGGCTT GACCCTTTTG GAAAAGCTGG CAAACGCAAC CATTGGCAGG TAATTCGCGC TATTGCAAAA GAGTTCGATT TTACGCTCGA TACTCCCATG AGCAAAATTC CCAAAAGCGC ACTTAAAATA TTGCTCAATG GCTCAGGCAA GCGCACCTTT GAGGTAGCTT ACACCTCTTC AGGACACACC AGCTTATATC CACAGCCTTT TCAAGGTGCC GTAGCATATG TGCAAGAAAT TCTCAATAAC GCCACAACCT CGAAAGTGCG GGAGTGGGCT GAAGCCTACA TGCTCCACCA ACCCTGCCCC GTATGCCTTG GCGCACGCTT AAAACCCGAA AGCTTGCAGG TTAAAATTCA TGGCTTAAAC ATTGCTGAAC TCGAAGCTTT GCCACTACCT GAAACCCTTG CCTTTTTTAA TAATCTACCG CCCAATCTTA GCCAAAAAGA GTTGATAATT GCCACTCCCG TGTTGCATGA AATCACCAAA CGGCTCCAAT TTTTATTGGA TGTTGGGTTA GGCTATCTCT CGCTTGACCG TAGCTCGCAC ACACTTTCGG GCGGCGAAGC ACAGCGCATT CGGCTTGCCT CGCAGCTTGG CTCGCAACTG AGCGGCGTGC TCTATGTGCT TGACGAGCCG AGTATTGGAT TGCATCAGCG CGACAACCAC AAGCTCATTA CCTCATTGAA GCATTTGCGC GACCTTGGCA ACACCGTGTT AGTGGTTGAG CACGATAAAG ATACCATGCT GGAAGCTGAT ACCATTGTGG ATCTTGGTCC GGGTGCGGGC GCTTACGGAG GCGAAATTGT GGCTTTTGGC GCAGCCCGTG AGCTTGACCC TTCGTCGCTA ACGGCAGGCT ACCTCAATGG CACCAACCGC GTTTTTTATG CAAGCGAAGC TTCATCCGAA AAAACTGATG CCGATGCCGA TGCCACACCA CTTTTTCTTA CGCTGAAAGG ATGTAAAGGC AACAATCTTA AAAACATTGA CGCACAAATT CCGCTCCGCA AATTAGTAAG CATTACGGGT GTAAGTGGCT CAGGTAAATC AACCTTGATT AATGAAACCC TTTACCCAAT CCTTGCACGC CACTTCTACC GCTCAAAAGT AGTAACCGCA CCATTCGACG CTATTGAAGG GATAGAGCTG CTTGACAAGG TGGTAAATGT TGACCAATCA CCCATTGGAC GCACACCGCG CTCCAATCCC GCAACCTACA CGGGAGCCTT TACCTTTATT CGCGACTTCT TTACCCGCTT GCCCGAAGCG CAAATTCGTG GCTACAAAGC GGGACGTTTT AGCTTTAACG TAAAAGGGGG GCGCTGCGAA GTGTGCCAAG GCGCAGGCAC GCGCAAAATT GAGATGAATT TTTTGCCCGA CGTTTACGTG CAGTGCGAAA ATTGCAAAGG CGAACGCTAC AACCGCGAAA CGCTGATGGT AAAGTATCGC GGTAAATCCA TTGCCGACGT ATTGGAAATG AGCATTACCG AAGCCGCTGA ATTTTTTACC GACTTCCCTC GCATTCGCCG CATTCTCAAT ACCATGCAAA GCGTTGGGCT TGGCTATCTC AAGCTGGGGC AACCCTCGCC CATGCTTTCA GGCGGCGAAG CACAACGCAT TAAATTATCG GCAGAGTTGG CTAAAATTCA AACAGGCAAA ACGCTCTATA TTTTAGATGA ACCAACCACG GGACTTCATT TTCAGGATAC GCAACATTTG CTGGAAGTGC TCCGCAAATT AGTAGAGAAA GGCAATAGCG TCATTATTAT TGAGCACAAT CTCGATATTA TTAAAAACAG CGACTGGGTT ATTGATTTAG GAGCAGAAGG GGGATTTGAA GGGGGAACAA TTATTGCAGA AGGCACACCT CAGCAAATTG CCGATACGCC TCATTCGCAT ACAGGTAGAT TTTTAAAGAT GGAGATGGGG GGTTAG
|
Protein sequence | MSFSHISIRG ARVHNLKNIS LDIPRNQFVV ITGLSGSGKS SLAFDTIYAE GQRRFMETLS PYARQYIGNI ERPDVDFIEG LSPVIAIDQK STSRSPRSTV GTITEIHDFI RLLYAKAGRR YNPETGAMVQ AQSADNILAT ILALPEGSKV QILSPLVTGR KGHYRELFER LRSKGFLRVR VDGELQEMVP NMQLERYKSH TIELVVDRLV LAPESEARVR EAVMLAISIS EHKSSVICTP FEGGFTELAF TLSKGDNEDA LPTSTLAPNH FSFNSPYGAC PTCNGLGELM QLSGELMIPD PSLSLNQGGL DPFGKAGKRN HWQVIRAIAK EFDFTLDTPM SKIPKSALKI LLNGSGKRTF EVAYTSSGHT SLYPQPFQGA VAYVQEILNN ATTSKVREWA EAYMLHQPCP VCLGARLKPE SLQVKIHGLN IAELEALPLP ETLAFFNNLP PNLSQKELII ATPVLHEITK RLQFLLDVGL GYLSLDRSSH TLSGGEAQRI RLASQLGSQL SGVLYVLDEP SIGLHQRDNH KLITSLKHLR DLGNTVLVVE HDKDTMLEAD TIVDLGPGAG AYGGEIVAFG AARELDPSSL TAGYLNGTNR VFYASEASSE KTDADADATP LFLTLKGCKG NNLKNIDAQI PLRKLVSITG VSGSGKSTLI NETLYPILAR HFYRSKVVTA PFDAIEGIEL LDKVVNVDQS PIGRTPRSNP ATYTGAFTFI RDFFTRLPEA QIRGYKAGRF SFNVKGGRCE VCQGAGTRKI EMNFLPDVYV QCENCKGERY NRETLMVKYR GKSIADVLEM SITEAAEFFT DFPRIRRILN TMQSVGLGYL KLGQPSPMLS GGEAQRIKLS AELAKIQTGK TLYILDEPTT GLHFQDTQHL LEVLRKLVEK GNSVIIIEHN LDIIKNSDWV IDLGAEGGFE GGTIIAEGTP QQIADTPHSH TGRFLKMEMG G
|
| |