Gene Csal_0448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0448 
Symbol 
ID4027022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp493135 
End bp496008 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content64% 
IMG OID637965606 
Productexcinuclease ABC subunit A 
Protein accessionYP_572509 
Protein GI92112581 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.775809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAGAA TTCTGGTCAG GGGTGCGCGC ACCCACAACT TGCAGAACAT CGACGTCGAA 
CTTCCGCGTA ACAAGCTGAT CGTCGTGACC GGCTTGTCCG GCTCCGGCAA ATCGTCGTTG
GCCTTCGACA CCCTCTACGC CGAGGGACAA CGGCGCTACG TGGAATCGCT CTCCACTTAC
GCCCGCCAAT TCCTGTCGAT GATGGAAAAG CCCGATGTCG ACCACATCGA AGGGCTGTCG
CCGGCGATTT CCATCGAACA GAAGTCGACC TCGCACAACC CGCGCTCGAC CGTGGGCACG
ATCACCGAGA TCTACGACTA TCTTCGCCTG CTCTACGCAC GCGCCGGGAC GCCGCGCTGT
CCCGAGCACG GCATCGACCT CGAGGCACAG ACCGTCTCGC AAATGGTCGA CCAGGTGCTG
GCGCTGCCCG AGGGCAGCAA GCTGATGTTG CTGGCACCGG TGGTCAGCGG GCGCAAGGGC
GAGCACCAGC AACTGCTGAC CGAGCTGCGC GCTCAGGGTT ATGTCCGCGC CATGGTCGAT
GGCCAGGCCG TCGAGCTCGA CGAGCTCGGA CCGCTCGAGA AGAACAAGAA GCACGACATC
AGCGTGGTGG TCGACCGCGT CAAGGTCCGC GACGGACTCG CTCAGCGTCT TGCCGAGTCC
TTCGAGACCG CGCTCGGTCT GGCCGACGGT ATCGCCGTCG TCCATGACAT GGACGGCGAA
CGCGACGACA TCGTGTTCTC GGCACGCTTC GCCTGCCCGG TATGCGGCTA TTCAATCGCC
GAGCTCGAGC CGCGGCTGTT CTCGTTCAAC AATCCCGCCG GCGCCTGCCC GACCTGCGAC
GGCCTCGGCG TGCGTCAGTA CTTCGATCCC GACAAGCTGA TCAGCCATCC CGAACTGTCT
CTGGCGGAAG GTGCCATCAA GGGCTGGGAC CGGCGCAGCG TGTATTACTT CAACCAGCTG
CAGGCAGTCG CCGAGCATTA TCGCTTCAAG CTCGAGACCC CATGGCAGGA TCTCGCGCGG
CACGAGCGCG AGGTGGTGCT GCGCGGCAGC GGTCATGACC AGATCACCTT CAGCTACGTC
AACGACCGGG GCCGCAAGGT CACCCGCGAG CATGCTTTCG AGGGTGTCCT GCCCAACATG
GAACGGCGCT ATCGCGAAAC CGAATCGAGC ATGGTGCGCG AAGACCTGGC CAAGTACCTG
GCCGTGCAGC CCTGTCCCTC CTGCGAGGGG ACTCGGTTGC GCAAGGAGGC GCGCCATGTC
TATGTCGACG GCCGCCCGCT GCCCGAGGTG GCACACCTGC CGATCGGCGA AGCCTGGCAG
TATTTCGCGA CACTCGAGCT GTCGGGACGC AAGGGCGAGA TCGCCGTCAA GATCGTGCAC
GAGATCCACT CACGGCTGGA ATTCCTGGTC AATGTGGGGC TCGACTACCT GACCCTGGAA
CGCAGTGCCG ACACCCTGTC GGGGGGCGAA GCCCAGCGCA TTCGCCTGGC CAGTCAGATC
GGTGCGGGCC TGGTCGGGGT CATGTACATT CTCGACGAGC CGTCCATCGG TCTTCATCAG
CGCGATAACG ACCGCCTGCT CAAGACGCTC GAACGCCTGC GCGACCTGGG CAACACGGTC
ATCGTCGTCG AGCACGACGA AGACGCCATC CGTGCCGCCG ACCATGTGCT CGACATCGGG
CCCGGGGCCG GCGTGCACGG AGGACGCATC GTCGCACAGG GCACGCCCCA GCAGATCGAG
CAGAGCGAGG AATCGCTGAC AGGCCAGTAC CTCTCCGACC AGCGGCGCAT CGAGATCCCG
CCCCATCGGA TCCCGGGCAA TCCGGAAAAG ATGCTGCGAC TCACCGGCGC CAACGGCAAC
AACCTGCAGA ACGTGACGCT CGAACTGCCC CTGGGGCTGT TCGTCTGCGT CACCGGCGTG
TCGGGCTCGG GCAAGTCGAC ACTGATCAAC TCCACGTTGA TGCCGGTAGC CACCCGCGAA
CTCAATCGCG CCACGTCACT GACACCCGCC CCCTACCGGC ACATCGAGGG GCTGGATCTA
CTCGACAAGG TCATCGACAT CGACCAGAGT CCGATCGGGC GCACGCCGCG GTCCAACCCG
GCCACCTATA CCGGCATCTT CACGCCCATA CGCGAGCTCT TCGCTGGCAC CCAGGAAGCA
CGCTCGCGCG GCTACAAGCC GGGGCGTTTC AGCTTCAACG TCAAGGGCGG GCGCTGCGAA
GCCTGCCAGG GCGAGGGCAT GATCAAGGTC GAGATGCACT TCCTGCCCGA CATCTACGTG
CCCTGCGACG TGTGCCAGGG AAAACGCTAC AACCGCGAGA CGCTGGAGAT CCACTACAAG
GGCAAGAACA TCCACGAGGT ACTCGAGATG ACCATCGAGG AGGCCCTGGA ATTCTTCAGC
CCGGTCCCGG CGATCGCCCG GCGCCTGCAG ACGTTGATGG ATGTGGGCTT GTCCTACGTT
CGCCTTGGGC AGAGCGCGAC CACGCTTTCC GGCGGCGAAG CGCAGCGCGT CAAGCTCGCC
CGCGAGCTCG CCAAGCGCGA CACCGGCAAG ACGCTCTACA TTCTCGACGA GCCCACTACC
GGGCTGCATT TCGAGGATAT TCGGCAACTG CTGGGGGTCC TGCACCGCCT GCGCGACCAT
GGCAATACCG TCGTCGTCAT CGAACACAAC CTCGATGTCA TCAAGACGGC GGACTGGTTG
ATCGACCTTG GCCCCGAGGG AGGCTCAGGC GGTGGCCGCA TCATTGCCGA GGGCACTCCG
GAACAGGTGG CCAAGCTCGA GCAGTCCCAC ACCGGGCGCT TTCTCAAGCC CTTGCTGGAC
AAGCGCACGG CGCCAGCGAA CAAGGCACGC ACCCCAAGCA CGCTCGAGAC CTGA
 
Protein sequence
MDRILVRGAR THNLQNIDVE LPRNKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSTY 
ARQFLSMMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIYDYLRL LYARAGTPRC
PEHGIDLEAQ TVSQMVDQVL ALPEGSKLML LAPVVSGRKG EHQQLLTELR AQGYVRAMVD
GQAVELDELG PLEKNKKHDI SVVVDRVKVR DGLAQRLAES FETALGLADG IAVVHDMDGE
RDDIVFSARF ACPVCGYSIA ELEPRLFSFN NPAGACPTCD GLGVRQYFDP DKLISHPELS
LAEGAIKGWD RRSVYYFNQL QAVAEHYRFK LETPWQDLAR HEREVVLRGS GHDQITFSYV
NDRGRKVTRE HAFEGVLPNM ERRYRETESS MVREDLAKYL AVQPCPSCEG TRLRKEARHV
YVDGRPLPEV AHLPIGEAWQ YFATLELSGR KGEIAVKIVH EIHSRLEFLV NVGLDYLTLE
RSADTLSGGE AQRIRLASQI GAGLVGVMYI LDEPSIGLHQ RDNDRLLKTL ERLRDLGNTV
IVVEHDEDAI RAADHVLDIG PGAGVHGGRI VAQGTPQQIE QSEESLTGQY LSDQRRIEIP
PHRIPGNPEK MLRLTGANGN NLQNVTLELP LGLFVCVTGV SGSGKSTLIN STLMPVATRE
LNRATSLTPA PYRHIEGLDL LDKVIDIDQS PIGRTPRSNP ATYTGIFTPI RELFAGTQEA
RSRGYKPGRF SFNVKGGRCE ACQGEGMIKV EMHFLPDIYV PCDVCQGKRY NRETLEIHYK
GKNIHEVLEM TIEEALEFFS PVPAIARRLQ TLMDVGLSYV RLGQSATTLS GGEAQRVKLA
RELAKRDTGK TLYILDEPTT GLHFEDIRQL LGVLHRLRDH GNTVVVIEHN LDVIKTADWL
IDLGPEGGSG GGRIIAEGTP EQVAKLEQSH TGRFLKPLLD KRTAPANKAR TPSTLET