Gene Csal_3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_3024 
SymbolsbcB 
ID4028990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3367620 
End bp3369107 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content66% 
IMG OID637968230 
Productexonuclease I 
Protein accessionYP_575067 
Protein GI92115139 
COG category[L] Replication, recombination and repair 
COG ID[COG2925] Exonuclease I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.377058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAGG GCAAGACCGA GCCGACCTTT CTTTGGCATG ATTACGAGAC CTTCGGTGCC 
GATCCTCGGC GCGACCGCCC CGCGCAGTTC GCCGCGATCC GCACCGACAG CGACTTCAAT
CCCGTCGGCG AACCAATGAC GTTCTATTGC AAGCCGGCGG ACGATTTCCT GCCTCATCCC
CAGGCCTGTC TGATCACGGG CATCACGCCC CAGGCGGCTC GCCGTCGTGG CCTGGCCGAG
ATCGAGTTCG CCGCGCGCAT CCACGCCGAG ATGAGTGTCC CCGGCACCTG CTCGCTGGGC
TATAACACGC TGCGTTTCGA CGACGAAGTC AGCCGTCATC TCTTCTACCG CAACTTCATC
GATCCGTACT CGCGCGAATG GCAGAACGGC AACTCGCGAT GGGACCTGAT CGATGTGGTG
CGTACCTATC ATGCCCTGCG TCCCGACGGC ATCGAGTGGC CGACGCGCGA AGACGGCGCG
CCGAGCTTTC GTCTCGAGGA CTTGACCGCC GCCAACGGCA TCGAGCATGC CGGTGCCCAC
GATGCATTGG CCGACGTCCG TGCGACCATC GAGCTGGCGC GTCTGCTCAA GGCCCGCAAC
GCCCAGTTGT TCGAGTACCT GCTCAAGCTG CGCGACAAGC GCACGGTGGC CAGGATGCTG
GACATCCAGG CGCGCAAGCC GATGCTGCAC GTGTCGCGGC GTTACCCCGC CAGTCGTGGC
TGCAGTGCCC TGGTGGTGCC GCTCGCCGAG CATCCCAGCA ATCCCAACGG GGTGATCGTC
TATGACCTGT CGGTGGATCC CTCACCGCTG CTGTCGATGG AGGCGGCGCA GATCCGCGAG
CGGGTGTTCG CCAGTGCCGA TGAGCTGGCC GAGGGAGAGG CGCGCATTCC TCTCAAGGTG
ATTCACATCA ACAAGAGTCC GGTGATCATG CCGGCGGCCT CGCTCAAGGA CGTCGATGGT
CCGCAGCGGG GCGAATACGG TGCCCTCGTG GCCCGGCTGG GCCTGGACAT GGCAGCCTGC
CGCCGCCACT GGAAGCAGCT CCAGGCGGCG CCGGACGTGG CGGCCAAGGC TGCCGAGGTA
TTCGCCGAGG GCCCCGGAGC ACCACCCAGC GACCCGGATC TGATGCTGTA TTCGGGGGGC
TTTTTCTCGC CCGCGGACCG GCGCGAGATG CAGCGCGCAC GCGAGATGCC GGCCTGGGAC
CTGGCCGAGG CGAGCTTCGC CTTTCAGGAT CCGCGTCTGG AGGAAATGCT GTTTCGTCTG
CGCGCGCGCA ACTATCCCGA CACCCTGAGC AGCGAGGAGC AGGCGCAGTG GGAGGCCTAT
CGCTGGGCGC GCATGAACGA CGCCGAGGTG GCCAGCCTGA CGTTGACGGG GTTCGCGCGC
GAGATCGAAC GTCTCAACCA GGTGCCGCTG GACGACGCGC AGCGCCAGGT GCTCGAAGAG
CTGGTCATGC ACGTCGAGGC GATGATGCCG CCGCAGGCGT TCGGCTGA
 
Protein sequence
MAQGKTEPTF LWHDYETFGA DPRRDRPAQF AAIRTDSDFN PVGEPMTFYC KPADDFLPHP 
QACLITGITP QAARRRGLAE IEFAARIHAE MSVPGTCSLG YNTLRFDDEV SRHLFYRNFI
DPYSREWQNG NSRWDLIDVV RTYHALRPDG IEWPTREDGA PSFRLEDLTA ANGIEHAGAH
DALADVRATI ELARLLKARN AQLFEYLLKL RDKRTVARML DIQARKPMLH VSRRYPASRG
CSALVVPLAE HPSNPNGVIV YDLSVDPSPL LSMEAAQIRE RVFASADELA EGEARIPLKV
IHINKSPVIM PAASLKDVDG PQRGEYGALV ARLGLDMAAC RRHWKQLQAA PDVAAKAAEV
FAEGPGAPPS DPDLMLYSGG FFSPADRREM QRAREMPAWD LAEASFAFQD PRLEEMLFRL
RARNYPDTLS SEEQAQWEAY RWARMNDAEV ASLTLTGFAR EIERLNQVPL DDAQRQVLEE
LVMHVEAMMP PQAFG