Gene Csal_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2999 
Symbol 
ID4028965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3336863 
End bp3338512 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content64% 
IMG OID637968205 
Productextracellular solute-binding protein 
Protein accessionYP_575042 
Protein GI92115114 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.305947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCCAC TCGCGGCATC CGACACAACA ACAAGCGGAA CAGACGACAT GCTTCGACAC 
TCGCACACTC GGCTTGCCAC GACACTGCTA TCCTCCCTGT CGCTGGCGTC ACTGGCCCTG
GCGGCGCCGG CCCAGGCCGA CACGCTGAAC ATCGGCGTCA TGGGGGAGCT GGCCTCGTTC
GATACCTCGC AGGTTTCCGG CGGTGTCTGG GAATCGCAGA TCCTCATGGA CGTCTACGAA
GGCCTGCTCA AGGAAAACCC CGAGGGCGAG GTGATGCCCG GCATGGCCAC CGACTGGGAC
ATCTCCGAGG ACGGCAGGAC CTATACCTTT CATCTGCGCG AAGGCGCCAA ATGGTCCGAC
GGCGCGCCGG TCACCGCCGA AGACTTCGTG TTCGGCTGGC AGCATCTGCT CGACCCGGCC
AGCGCTTCGA AATACGCCTA CCTGCTCTAT CCGATAAAGA ATGCCGAAGC CGTCAACACG
GGCGACAGGC CGCTCGACGC ACTCGGCGTC GAATCGCTGG ACGATGGCAA GACGCTCAAG
GTGACACTCG ATGCCCCCAC GCCCTACTTC CTGCAGCTGC TGACCCACTA CACGGCCTAC
CCGGTCCCCA AGCACGCCGT CGAGAAATAC GGCAAGCAGT GGGTCAAGAT GGACAACATC
GTCACCAACG GCGCCTTCAC GCCCGTCGAG TGGGTCTCGC AATCGCGCAT CAGCGTCGAG
AAGAATCCCG ACTACTACGA GGCCGACGAG GTCGAACTCG ACGGCGTCAA CTACTTCAAC
ACCGAGGATC GCAATGCCGC CATCTCGCGC TTCCGCGCCG GCGAGCTGGA CATCGTCCGC
GATTACCCCT CGAGCCGTTA CCAGTGGCTC GAGGACAACC TCCCCGAGGC CACCCACCTG
AGCCCGATGC TGGGGTCCTA CTACTACGTG CTCAATACCC GCGAGGGGCG CCCGACCGCC
GACAAGCGGG TCCGCGAGGC CCTGAACCTG GTCGCGCGCC GCAAGGTACT TTCCGAGCAG
ATCATGGCCG GCAGTTTCAA GGATGCCTAC TCGCTGGTCC CGCCGGGCAC CAGCCATTAC
GACGTCCAGC GCATGGACGG TGTCGATGGC GACTACCAGA AGCGCCTGGC CAGGGCCAAG
CAACTGATCG AGGAGGCCGG CTACGGCCCC GACAACCCGC TGCACCTGCA ACTGCGCTAC
AACACGTCCG ATGAGCACAA GAAGATCGCC ATCGCCCTGG CCGCGATGTG GAAGCCGCTG
GGTGTCGACG TCGAGATGAC CAATGCCGAG GCCACCGTGC ACTACCAGAC CATCCAGCAA
GGCGACTTCG ATATCGCCCG TGCCGGCTGG ATCGCCGACT ACAACGATGC CGAGAACTTC
CTGACCTTGC TGCGCAGCGG CGTCGGCAAC AACTACGGCG GCTACGCCAA TCCCGAGTAC
GACGCGCTGC TCGCTCAAGC CGCCACCGTT CGGGACCTCG ACGAGCGCGA GGCACTGCTC
GAAAAAGCCG AGAACGTCGC CCTCGACGAC TACGCCCTCG TGCCGCTGCT CTATTACGTC
ACTCGCAATC TGGTCAATCC CGATATCAGC GGCTGGCAGG ACAACGCCGA GGACGACCAT
CCATCGCGCT GGGTGACGTT CACCGAGTAA
 
Protein sequence
MSPLAASDTT TSGTDDMLRH SHTRLATTLL SSLSLASLAL AAPAQADTLN IGVMGELASF 
DTSQVSGGVW ESQILMDVYE GLLKENPEGE VMPGMATDWD ISEDGRTYTF HLREGAKWSD
GAPVTAEDFV FGWQHLLDPA SASKYAYLLY PIKNAEAVNT GDRPLDALGV ESLDDGKTLK
VTLDAPTPYF LQLLTHYTAY PVPKHAVEKY GKQWVKMDNI VTNGAFTPVE WVSQSRISVE
KNPDYYEADE VELDGVNYFN TEDRNAAISR FRAGELDIVR DYPSSRYQWL EDNLPEATHL
SPMLGSYYYV LNTREGRPTA DKRVREALNL VARRKVLSEQ IMAGSFKDAY SLVPPGTSHY
DVQRMDGVDG DYQKRLARAK QLIEEAGYGP DNPLHLQLRY NTSDEHKKIA IALAAMWKPL
GVDVEMTNAE ATVHYQTIQQ GDFDIARAGW IADYNDAENF LTLLRSGVGN NYGGYANPEY
DALLAQAATV RDLDEREALL EKAENVALDD YALVPLLYYV TRNLVNPDIS GWQDNAEDDH
PSRWVTFTE