Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0448 |
Symbol | |
ID | 4027022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 493135 |
End bp | 496008 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637965606 |
Product | excinuclease ABC subunit A |
Protein accession | YP_572509 |
Protein GI | 92112581 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.775809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAGAA TTCTGGTCAG GGGTGCGCGC ACCCACAACT TGCAGAACAT CGACGTCGAA CTTCCGCGTA ACAAGCTGAT CGTCGTGACC GGCTTGTCCG GCTCCGGCAA ATCGTCGTTG GCCTTCGACA CCCTCTACGC CGAGGGACAA CGGCGCTACG TGGAATCGCT CTCCACTTAC GCCCGCCAAT TCCTGTCGAT GATGGAAAAG CCCGATGTCG ACCACATCGA AGGGCTGTCG CCGGCGATTT CCATCGAACA GAAGTCGACC TCGCACAACC CGCGCTCGAC CGTGGGCACG ATCACCGAGA TCTACGACTA TCTTCGCCTG CTCTACGCAC GCGCCGGGAC GCCGCGCTGT CCCGAGCACG GCATCGACCT CGAGGCACAG ACCGTCTCGC AAATGGTCGA CCAGGTGCTG GCGCTGCCCG AGGGCAGCAA GCTGATGTTG CTGGCACCGG TGGTCAGCGG GCGCAAGGGC GAGCACCAGC AACTGCTGAC CGAGCTGCGC GCTCAGGGTT ATGTCCGCGC CATGGTCGAT GGCCAGGCCG TCGAGCTCGA CGAGCTCGGA CCGCTCGAGA AGAACAAGAA GCACGACATC AGCGTGGTGG TCGACCGCGT CAAGGTCCGC GACGGACTCG CTCAGCGTCT TGCCGAGTCC TTCGAGACCG CGCTCGGTCT GGCCGACGGT ATCGCCGTCG TCCATGACAT GGACGGCGAA CGCGACGACA TCGTGTTCTC GGCACGCTTC GCCTGCCCGG TATGCGGCTA TTCAATCGCC GAGCTCGAGC CGCGGCTGTT CTCGTTCAAC AATCCCGCCG GCGCCTGCCC GACCTGCGAC GGCCTCGGCG TGCGTCAGTA CTTCGATCCC GACAAGCTGA TCAGCCATCC CGAACTGTCT CTGGCGGAAG GTGCCATCAA GGGCTGGGAC CGGCGCAGCG TGTATTACTT CAACCAGCTG CAGGCAGTCG CCGAGCATTA TCGCTTCAAG CTCGAGACCC CATGGCAGGA TCTCGCGCGG CACGAGCGCG AGGTGGTGCT GCGCGGCAGC GGTCATGACC AGATCACCTT CAGCTACGTC AACGACCGGG GCCGCAAGGT CACCCGCGAG CATGCTTTCG AGGGTGTCCT GCCCAACATG GAACGGCGCT ATCGCGAAAC CGAATCGAGC ATGGTGCGCG AAGACCTGGC CAAGTACCTG GCCGTGCAGC CCTGTCCCTC CTGCGAGGGG ACTCGGTTGC GCAAGGAGGC GCGCCATGTC TATGTCGACG GCCGCCCGCT GCCCGAGGTG GCACACCTGC CGATCGGCGA AGCCTGGCAG TATTTCGCGA CACTCGAGCT GTCGGGACGC AAGGGCGAGA TCGCCGTCAA GATCGTGCAC GAGATCCACT CACGGCTGGA ATTCCTGGTC AATGTGGGGC TCGACTACCT GACCCTGGAA CGCAGTGCCG ACACCCTGTC GGGGGGCGAA GCCCAGCGCA TTCGCCTGGC CAGTCAGATC GGTGCGGGCC TGGTCGGGGT CATGTACATT CTCGACGAGC CGTCCATCGG TCTTCATCAG CGCGATAACG ACCGCCTGCT CAAGACGCTC GAACGCCTGC GCGACCTGGG CAACACGGTC ATCGTCGTCG AGCACGACGA AGACGCCATC CGTGCCGCCG ACCATGTGCT CGACATCGGG CCCGGGGCCG GCGTGCACGG AGGACGCATC GTCGCACAGG GCACGCCCCA GCAGATCGAG CAGAGCGAGG AATCGCTGAC AGGCCAGTAC CTCTCCGACC AGCGGCGCAT CGAGATCCCG CCCCATCGGA TCCCGGGCAA TCCGGAAAAG ATGCTGCGAC TCACCGGCGC CAACGGCAAC AACCTGCAGA ACGTGACGCT CGAACTGCCC CTGGGGCTGT TCGTCTGCGT CACCGGCGTG TCGGGCTCGG GCAAGTCGAC ACTGATCAAC TCCACGTTGA TGCCGGTAGC CACCCGCGAA CTCAATCGCG CCACGTCACT GACACCCGCC CCCTACCGGC ACATCGAGGG GCTGGATCTA CTCGACAAGG TCATCGACAT CGACCAGAGT CCGATCGGGC GCACGCCGCG GTCCAACCCG GCCACCTATA CCGGCATCTT CACGCCCATA CGCGAGCTCT TCGCTGGCAC CCAGGAAGCA CGCTCGCGCG GCTACAAGCC GGGGCGTTTC AGCTTCAACG TCAAGGGCGG GCGCTGCGAA GCCTGCCAGG GCGAGGGCAT GATCAAGGTC GAGATGCACT TCCTGCCCGA CATCTACGTG CCCTGCGACG TGTGCCAGGG AAAACGCTAC AACCGCGAGA CGCTGGAGAT CCACTACAAG GGCAAGAACA TCCACGAGGT ACTCGAGATG ACCATCGAGG AGGCCCTGGA ATTCTTCAGC CCGGTCCCGG CGATCGCCCG GCGCCTGCAG ACGTTGATGG ATGTGGGCTT GTCCTACGTT CGCCTTGGGC AGAGCGCGAC CACGCTTTCC GGCGGCGAAG CGCAGCGCGT CAAGCTCGCC CGCGAGCTCG CCAAGCGCGA CACCGGCAAG ACGCTCTACA TTCTCGACGA GCCCACTACC GGGCTGCATT TCGAGGATAT TCGGCAACTG CTGGGGGTCC TGCACCGCCT GCGCGACCAT GGCAATACCG TCGTCGTCAT CGAACACAAC CTCGATGTCA TCAAGACGGC GGACTGGTTG ATCGACCTTG GCCCCGAGGG AGGCTCAGGC GGTGGCCGCA TCATTGCCGA GGGCACTCCG GAACAGGTGG CCAAGCTCGA GCAGTCCCAC ACCGGGCGCT TTCTCAAGCC CTTGCTGGAC AAGCGCACGG CGCCAGCGAA CAAGGCACGC ACCCCAAGCA CGCTCGAGAC CTGA
|
Protein sequence | MDRILVRGAR THNLQNIDVE LPRNKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSTY ARQFLSMMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIYDYLRL LYARAGTPRC PEHGIDLEAQ TVSQMVDQVL ALPEGSKLML LAPVVSGRKG EHQQLLTELR AQGYVRAMVD GQAVELDELG PLEKNKKHDI SVVVDRVKVR DGLAQRLAES FETALGLADG IAVVHDMDGE RDDIVFSARF ACPVCGYSIA ELEPRLFSFN NPAGACPTCD GLGVRQYFDP DKLISHPELS LAEGAIKGWD RRSVYYFNQL QAVAEHYRFK LETPWQDLAR HEREVVLRGS GHDQITFSYV NDRGRKVTRE HAFEGVLPNM ERRYRETESS MVREDLAKYL AVQPCPSCEG TRLRKEARHV YVDGRPLPEV AHLPIGEAWQ YFATLELSGR KGEIAVKIVH EIHSRLEFLV NVGLDYLTLE RSADTLSGGE AQRIRLASQI GAGLVGVMYI LDEPSIGLHQ RDNDRLLKTL ERLRDLGNTV IVVEHDEDAI RAADHVLDIG PGAGVHGGRI VAQGTPQQIE QSEESLTGQY LSDQRRIEIP PHRIPGNPEK MLRLTGANGN NLQNVTLELP LGLFVCVTGV SGSGKSTLIN STLMPVATRE LNRATSLTPA PYRHIEGLDL LDKVIDIDQS PIGRTPRSNP ATYTGIFTPI RELFAGTQEA RSRGYKPGRF SFNVKGGRCE ACQGEGMIKV EMHFLPDIYV PCDVCQGKRY NRETLEIHYK GKNIHEVLEM TIEEALEFFS PVPAIARRLQ TLMDVGLSYV RLGQSATTLS GGEAQRVKLA RELAKRDTGK TLYILDEPTT GLHFEDIRQL LGVLHRLRDH GNTVVVIEHN LDVIKTADWL IDLGPEGGSG GGRIIAEGTP EQVAKLEQSH TGRFLKPLLD KRTAPANKAR TPSTLET
|
| |