Gene Csal_0837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0837 
Symbol 
ID4027400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp934197 
End bp935903 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content63% 
IMG OID637966003 
Producthypothetical protein 
Protein accessionYP_572893 
Protein GI92112965 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACGAC GTATCTCACT TGCCTGCGTG ATGGCCGGCA GCCTGCTTCT GGCAGGTTGC 
GGACAAGATT CGGCAGACGC ACCACTGGCA CATGTTCCGG CTGACACCCC CTTTCTCTTC
GCCAACCTCG AAACCATCGA CGACGCGACT CTCGACGCCG CGCTGGCGTC ATCAAACGCT
TCACTCGCGC AACAGCGTAT CGACCTGCGC CAATTCGCAG AAGAACTGCG CCGCGACGGC
GAGGCCGAGT CCCTCGCCAA TGTGCTTGAC GCACTCGCCG AGGAGCTTGC CGGCAAATCG
GACTATCAGC AGGTCGCCGA ACAGATCGGC GTCGACCTGG GCGGTCAGAA CGCTCTCTAC
GGTCTGGGCC TGAGCCCGGT ATTGCGCCTG AGCATCAACG ATGCTGAGCG CTACCAGGCT
TTCCTGCAAC GTCTTGCAGA CGCTGCAGGC CTGCCGCTGG AAACCCGCAC GCAGGGCGAG
CTGGAATACC GCCAGGCACG CCTCGGCGAG GCGCCGCTGC AACTGCTGAG TACCGTTCAT
GACGGCCAGG CGGTGCTGGC AGTCGCCCCC ACCGAACTGG ATGACGACGC ACTGCAGCAA
GTGTTAGGCA CGAGCCTGCC CGACAGCAGC GTGCAGGATA CCCAGCGCCT CAGTGAGCTG
GCCGACGCCA AGGACTACCT GCCCTACGGC CTGGGCTATG TCGATACGAC ACGCTTGGCG
ACCTTGCTCA CCGGCAGTCA GGATCTCATG ATTCAGGCCT TTCGCGCATT CGCCGAGCAG
ACGCAAGGTC AGGCCCCAGA ACCGGTTTCG CAGAGCTGCC GCGAGGATGC GACGCGCCTA
GCCTCGCGCA TGCCGCGACT GAGCGCCGGC TACACCACGC TCGACGCCGC ACGCACCGAG
CAACGCTTCG ATGTGTCATT GGCCGAAGAT ATCACCGCCC CGCTTGCCTC GCTCACTTCA
ACACTGCCGG GCCTGGGCAA TGACTCGCTT GAGTCTCCCT TCGACCTTGC GATCGCACTG
CCCATGAACG ACCTGCGCGA CCTGCTGACC CAACAGATCC AACACGTGCG TACCGCACCG
TTCAGTTGCT CGGCGCTCGC CGAACTCAAC AACGATCTGG ACGAACTCGG CCGTCAGGCC
AACATGCTGG CCATGCCCCC GTTCGGTAGC CTGCGCGGCA TGCGGCTGGT AATCGATGAG
CTCACGATGC CCAAGCATAG CGATCAGCCC GCCATCAAGG GCGCTCTGCT GGTAGCCTCC
AGCGACCCCA ACGGGTTGAT GGCGATCGGC CAGAGCATGC TGCCCGGGCT CGCGACACTC
TCCCTTTCCA ACGACGGCGA ACCCCAAGCT CTGCCGCCAC AGCTCACCGC GATGCTAGGC
GATGCACCGG CCTGGCTGGC CATGACAGAC AAGGCACTCG GCGTGGCAAC GGGTGAGGGC
GAGCAGACGA CGCTCAAGTC CTTGCTTCAG GAAGAAACCG GCGAGGCCGG CGAACTGATG
CACGTCAAGC TTTCCGGCGA CATGTACGCC AAGTGGCTTC AGCTTGCCGA CGCCTTCGGC
AACCTCGCAG GCAACGACGC TGCAGCACTC GAAGAGCAGC TCGATGCCAT GCAGAACCAA
TTCGAGCGCA TCGACAACGT CGTAATACGC ATGCGTATGG AAGACGACGG CCTGGTCATC
AACAACCGTA TCGACTGGCA ACAGTAA
 
Protein sequence
MLRRISLACV MAGSLLLAGC GQDSADAPLA HVPADTPFLF ANLETIDDAT LDAALASSNA 
SLAQQRIDLR QFAEELRRDG EAESLANVLD ALAEELAGKS DYQQVAEQIG VDLGGQNALY
GLGLSPVLRL SINDAERYQA FLQRLADAAG LPLETRTQGE LEYRQARLGE APLQLLSTVH
DGQAVLAVAP TELDDDALQQ VLGTSLPDSS VQDTQRLSEL ADAKDYLPYG LGYVDTTRLA
TLLTGSQDLM IQAFRAFAEQ TQGQAPEPVS QSCREDATRL ASRMPRLSAG YTTLDAARTE
QRFDVSLAED ITAPLASLTS TLPGLGNDSL ESPFDLAIAL PMNDLRDLLT QQIQHVRTAP
FSCSALAELN NDLDELGRQA NMLAMPPFGS LRGMRLVIDE LTMPKHSDQP AIKGALLVAS
SDPNGLMAIG QSMLPGLATL SLSNDGEPQA LPPQLTAMLG DAPAWLAMTD KALGVATGEG
EQTTLKSLLQ EETGEAGELM HVKLSGDMYA KWLQLADAFG NLAGNDAAAL EEQLDAMQNQ
FERIDNVVIR MRMEDDGLVI NNRIDWQQ