Gene Csal_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2008 
Symbol 
ID4027092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2268048 
End bp2269691 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content64% 
IMG OID637967203 
Producthypothetical protein 
Protein accessionYP_574058 
Protein GI92114130 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGATG CTCCCCTGCG CGCGACGCGC CACCATCGCC GGCGGCTTTC GCCGATCTGG 
ATCGTACCGC TCGTGGCGGT TCTCATCGGC GCATGGATGC TCTACGACAA TCTCTCGGCA
CTCGGCCCCA CGATCACCCT GGAGATGAAA AACGCCGAGG GCATCGAGGC CGGCAAGACA
TTGATCAAGA CGCGCAACGT CGAGGTCGGC CGCGTCGAGG ACGTGACACT CTCCGAAGAC
ATGTCCCACA CCATCATCAC GGCCCGCATG AAGCCCGATA CCGAACGCAT GCTGAATGAC
GAGGCGCGCT TCTGGGTCGT CAAGCCGCGT ATCGACCGGG AAGGCATCAG CGGACTGGGC
ACCGTGCTCT CGGGGGCCTA TATCCAGCTG CTGCCCGGCA ACGGCGAAAC CGCCCAGCGT
GAATTCGAGG TACTCGACCA ACCGCCCGTG GCTCCGCCCG ATGCGCCGGG CATTCGCGTC
AACCTGGTCA GCAAGGTGGG CAGTTCCCTG CGCGCCGGCG ACCCCATCAC CTATCAGGGA
TTCACCGTCG GGCGCGTGGA GAACACCGAG TTCGATCCCG AAGAAAAGGA AATGCGCCAT
CGCCTTTACA TCCAGTCGCC TTATGACGTT CTGGTCACGG ACACCACACG CTTCTGGATC
TCGTCGGGTG TCGACGTGCG CCTCGACTCC CAGGGCTTCC GGGTCAACGT GGAATCCATG
GAATCGCTCA TCGGCGGCGG CGTGACCTTC GGCGTGCCGG AGGATGTCGC CATGGGGCAT
CCCGCCGAAT CCGAGGCGAC CTACCAACTC TTCAGCGACG AGGAAAGCGC GCGTGAAGGC
ACCTTCGACC GCTACCTCGA ATACGTGCTG CTGGTCGACG ACACCGTACG CGGCTTGAGC
CGAGGCGACT CAGTCGAGTA CCGCGGCGTG CGCGTGGGGA CCGTGGAAGC CGTGCCCTGG
CGCTTCTCCG CTCCCCAGCC GGATACGCTG AACCGTTTCG CGATTCCGGT ACTGATTCGC
ATCGAGCCAC AGCGCTTCGA CGACGCCATG GCGAATTTTG ACGCCGAGGA CTGGCGTGCC
CGCCTCGAAC GCATGTTCGA GCACGGCCTG CGCGCCACGC TCAAGGCCGG CAATCTGCTC
ACCGGCGCGC TGTTCGTCGA CCTCAACTTC CGCGACGACC CCGAGCCCTA CGAAGCGCTC
ACCTTCGAGG GCAAGACGGT GTTCCCGACG ACGTCGGGCG GCTTCGCGCA AATCGAGCAG
AAAGTCTCCA ACCTGCTCGA CAAGCTCAAC GAGCTCGAGG TCGAGCCGAT CCTCACCTCG
CTGAACGATA CCTTGACGAC GACACGCGCC ACGATGCGCA AGGTCAACGA CATCGCCAAC
TCCGTGGATA CCTTGCTGAA CGACCCGGCG ACCCGCGAGC TGCCGCAGAA CCTCAACGAG
ACGCTGCGCC AGACCCGCGA TACGCTGCAG GGCTTCTCGC CCGACTCGCA GGGCTACCGT
GAACTCAACG ACACACTCTC GCGGCTCGAG TCGCTGATGC GCGACCTGCA GCCCGTGGTG
CGCACGCTCA GCGAGAAACC CAACGCCTTG ATCTTCGACC GTGAAGAGAC ACGAGATCCA
TTACCGAGGG CCCCAAGCCA ATGA
 
Protein sequence
MPDAPLRATR HHRRRLSPIW IVPLVAVLIG AWMLYDNLSA LGPTITLEMK NAEGIEAGKT 
LIKTRNVEVG RVEDVTLSED MSHTIITARM KPDTERMLND EARFWVVKPR IDREGISGLG
TVLSGAYIQL LPGNGETAQR EFEVLDQPPV APPDAPGIRV NLVSKVGSSL RAGDPITYQG
FTVGRVENTE FDPEEKEMRH RLYIQSPYDV LVTDTTRFWI SSGVDVRLDS QGFRVNVESM
ESLIGGGVTF GVPEDVAMGH PAESEATYQL FSDEESAREG TFDRYLEYVL LVDDTVRGLS
RGDSVEYRGV RVGTVEAVPW RFSAPQPDTL NRFAIPVLIR IEPQRFDDAM ANFDAEDWRA
RLERMFEHGL RATLKAGNLL TGALFVDLNF RDDPEPYEAL TFEGKTVFPT TSGGFAQIEQ
KVSNLLDKLN ELEVEPILTS LNDTLTTTRA TMRKVNDIAN SVDTLLNDPA TRELPQNLNE
TLRQTRDTLQ GFSPDSQGYR ELNDTLSRLE SLMRDLQPVV RTLSEKPNAL IFDREETRDP
LPRAPSQ