Gene Csal_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2149 
Symbol 
ID4026489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2420732 
End bp2421898 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content65% 
IMG OID637967354 
Producthypothetical protein 
Protein accessionYP_574199 
Protein GI92114271 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.432183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATGTCGC TTTCACGCCT CCGCGATCCG TATTTCATCT ATCGCTATCG CCATCGTCTG 
CACGTGTTGC GCACATCGCT GGCGCTGGCC ATCACCTATG TCATCATCCT GACCCTCGAG
ATTCCCCACG GCAGCTGGGC GCTGGTCAGC ACGATGATGG TGATGGGCAA CCTGCCGCAT
ATCGGCGGCG TCATCGACAA GGGGGGACAG CGCCTGCTGG GCACGGTACT CGGCGCGATC
TGGGGCGTGT TGCTGGTGCT GATCCCGGCA CCGGCACCCT GGGTGATTCC CGCCTGGACG
CTGATCGGCA TCGCCGTGGC CACGCACACC ACCTTTGCCA CGCGCTATGG CTACAGTGCG
CTGATGTTCG GCGTGACGCT CCTGATGGTC GTCGGCGATG GGCATCAGGA TCTGGGCATC
GCGCTGTGGC GCGCCTTCGA CGTCCTGATC GGCACGCTCG TCGGCATTCT CGCCACGCTC
TTCATCCTGC CGCAGAAAGC CACCGACTTG CTGCGCTTTT TGCTGGCGGA CAACCTCGAC
AAGCTGGCGC GCCTTTATCA TGCCCATACG AGCGCCGCCC AGCAGGAAGA CGTCGATACG
CGCCAGTTGC TCAAGACCAC CTCCACGCAA CTGGTCAAGC AACGTGGCCT GGTGGACGCC
ATCCACAGCG AGCGCCGCCT GCACCGCGAC GACCTGGAAC GCATCCTGTC GCTGGAAAGA
CGCATGCTGT CGACCATCGA GCTGTTGCTG GAGACGCACT GGGCGACCCG AGCCGGCCAC
GACATCATCG CCGGTCTGGA AGGCTTGCGT GACGAGCAGC ACCGCCTGGC CCGCGCCCTG
GGCAGCCTGG CCTTTCAAGT GCGCACCGGG CAGAGCATCG ACCTCACGGT GGTCGCCTTC
GACCTGCAGC GTCACGCGCA AGCGACCTTG AGCGTGCATG CCGACGACGG TCGCGCACTC
TTCAGCCCAA GCGGCTATCT ATGGCTCAAC CGCGAGCTGG CACGCCTCAC CCAGGCGCTG
ATCGACACGC TGCAGACCAT CAACCGCTTG CCCAGTGCCC GCTTGCGTCG ACGCGCCTCG
CGCCAGGCCT TGATCCGCGA CCGCCTGGCC ATGCCGCTGG ATCCGTCCCA GGGGCGGCGC
GACGGCGACA AGGACAAGCC CAACTGA
 
Protein sequence
MMSLSRLRDP YFIYRYRHRL HVLRTSLALA ITYVIILTLE IPHGSWALVS TMMVMGNLPH 
IGGVIDKGGQ RLLGTVLGAI WGVLLVLIPA PAPWVIPAWT LIGIAVATHT TFATRYGYSA
LMFGVTLLMV VGDGHQDLGI ALWRAFDVLI GTLVGILATL FILPQKATDL LRFLLADNLD
KLARLYHAHT SAAQQEDVDT RQLLKTTSTQ LVKQRGLVDA IHSERRLHRD DLERILSLER
RMLSTIELLL ETHWATRAGH DIIAGLEGLR DEQHRLARAL GSLAFQVRTG QSIDLTVVAF
DLQRHAQATL SVHADDGRAL FSPSGYLWLN RELARLTQAL IDTLQTINRL PSARLRRRAS
RQALIRDRLA MPLDPSQGRR DGDKDKPN