Gene Csal_0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0220 
Symbol 
ID4027303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp246764 
End bp247765 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content66% 
IMG OID637965371 
Productperiplasmic solute binding protein 
Protein accessionYP_572283 
Protein GI92112355 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.732372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAGT CTTTCACCTT ACTGCTCGGT GCCGCCGCTC TGACGCTGTC CGGTGCGGCA 
CGCGCCGAGG GTCCGATGAG CGTCGTCGCC AGCTTCAGCA TTCTCGGCGA CATGGTCGAG
GAAGTCGGTG GCGAGCATGT CGACGTCACC ACGCTGGTCG GTCCGGACGG CGACGCCCAT
GTCTTCTCCC CCAGCCCCAC CGACGCACGC GCTGTCGGCG AGGCGGACCT GTTCGTCGTC
AACGGGCTGC ATTTCGAAGG CTGGCTGGAC CGCCTGGTGG AAGCCAGCGG CTACGAGGGG
CCGGTGGTCG TGGCAAGCCG GGGCATCGAT GCCCTGAGCT TCGACGAGGA GCGCGAAGAG
CACTCTTCCG ATCATGAAGG TCACGACCAC GCCACAGGCC ACGATCATGA CCACGACCAC
GACCACGACC ACGACCACGA CCACGACCAC AGTGAGCATG CGGGCCACGA CCACGGTCCG
GAAGACCCGC ACGCCTGGCA GGACCTGCAA AACGGCAAGC AGTACGTGGC CAACATCCGC
GACGCGCTCG TCGCGGCAGA CCCCGAGCAT GCCGCTGACT ATCGCCGCAA TGCCGAGCAA
TACGTCGAGG CCATGGATAC GCTGGATGCC GAGGTCCATC GTCGGATCGG CGCGATTCCC
GAGGCCAATC GCGTGCTGAT CACCAACCAC GATGCCTTCG GCTATTTCGC CAACGCCTAT
GGGCTGGACG TGCTCTCGCC GGTCGGCCTC TCCACGGCCG CCGAGCCCAG CGCCGCCGGC
ATGGCCAAGC TGATCGAACA GATCCAGGCA CGCAACGTCA AGGCACTGTT CCTGGAAAAC
ATGACCAGCC CCGCCCTGCT CGAGCAGCTG GCCGACGAAA CCGGGGTGAC CATCGGAGGC
ACGCTCTACG CCGGCGCCCT GGCGGCCGAG GGCGAAGCCA GCACCTACCT CGGCATGTTC
CGTCACAATG TCGATACGCT GACCGAGGCC TTGAAGGACT GA
 
Protein sequence
MSKSFTLLLG AAALTLSGAA RAEGPMSVVA SFSILGDMVE EVGGEHVDVT TLVGPDGDAH 
VFSPSPTDAR AVGEADLFVV NGLHFEGWLD RLVEASGYEG PVVVASRGID ALSFDEEREE
HSSDHEGHDH ATGHDHDHDH DHDHDHDHDH SEHAGHDHGP EDPHAWQDLQ NGKQYVANIR
DALVAADPEH AADYRRNAEQ YVEAMDTLDA EVHRRIGAIP EANRVLITNH DAFGYFANAY
GLDVLSPVGL STAAEPSAAG MAKLIEQIQA RNVKALFLEN MTSPALLEQL ADETGVTIGG
TLYAGALAAE GEASTYLGMF RHNVDTLTEA LKD