Gene Csal_1702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1702 
Symbol 
ID4028540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1934686 
End bp1935666 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content59% 
IMG OID637966890 
Producthypothetical protein 
Protein accessionYP_573753 
Protein GI92113825 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.412934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAGTC ACCCCCACCA CCCTCACGAC CACAGCTATA AGTTGCTGTT CTCACACCCT 
GAGATGGTGA GAGATCTATT GACCGGGTTC GTCAAGGAAG CCTGGGTGGA ACAACTCGAC
TTCTCGACAC TGGAGAAGGT CAGCGGCTCC TATATCACCG AAGATCTGCG AGATCGCGAG
GACGACGTCA TCTGGCGAGT GCGTTGGGGC GACGACTGGC TCTATGTTTA TTTGCTGCTC
GAGTTTCAGT CGAGCGTCGA TAGATTCATG GCCGTGCGAG TCATGACCTA CCTGGGGCTG
CTCTATCAGG ACTTGATTCG TCAGGAGGCC TTTACCCCCA ATGGCAAGCT ACCGCCAGTG
TTGCCGATTG TGCTCTACAA TGGGGAGAAG CGCTGGACGG CGGCACAAAA CGTGGCGGAC
CTGGTCGAAC AGGTACCCGG AGGGCTCGAA CGTTATCGGC CGAACTTGGC CTACCTGCTT
CTCGACGAAG GAGCGGTCAT CAGCGATCCT GAGTGGTCGG ATCACATGCG CAACGTGGCT
GCTGCGCTCT TTCGATTGGA GCACAATCGC GACGAGCAAG ACATGCTGGA GGTGCTGGGC
ACGCTGGTCG AGTGGCTCAA GGCGCCCGAG CAAACCGGGC TACGACGGGC CTTCGTGGTG
TGGATACGCC GCGTACTGCT GCCCAACCGG GCGCCGGGGA TGGAACTGCC CGAGTTCAAC
GAGTTGCAGG ATCTACACGA GGTACACGAC ATGCTGGCAG AACGCATCAA GCAATGGCCT
GAACGGTGGG AAGAGAAAGG CCGTCAGGAA GGCCGTCAAG AAGGGCGTAA AGAAGGGCGT
CAGGAAGGCG AACAACGGGG CATCGAGAAG ACCGCCCGCA ACCTGATCAA GCTGGGTGTA
CTCAGTGATG AACAGATCGC CGAGGCCACG GGGCTGACGG TGGCCGAGGT GGAAGGGCTG
CGCGAAGAAG ACACGCAGTG A
 
Protein sequence
MASHPHHPHD HSYKLLFSHP EMVRDLLTGF VKEAWVEQLD FSTLEKVSGS YITEDLRDRE 
DDVIWRVRWG DDWLYVYLLL EFQSSVDRFM AVRVMTYLGL LYQDLIRQEA FTPNGKLPPV
LPIVLYNGEK RWTAAQNVAD LVEQVPGGLE RYRPNLAYLL LDEGAVISDP EWSDHMRNVA
AALFRLEHNR DEQDMLEVLG TLVEWLKAPE QTGLRRAFVV WIRRVLLPNR APGMELPEFN
ELQDLHEVHD MLAERIKQWP ERWEEKGRQE GRQEGRKEGR QEGEQRGIEK TARNLIKLGV
LSDEQIAEAT GLTVAEVEGL REEDTQ