Gene Csal_2416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2416 
Symbol 
ID4026853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2711581 
End bp2712834 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content64% 
IMG OID637967618 
Productextracellular solute-binding protein 
Protein accessionYP_574462 
Protein GI92114534 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.230559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCA TCCATGCACT CGCCGCGGGG GCATTGCTGG TCACTGCCGT GGGGCAGGCC 
CAGGCCGCTC AAATCGAGGT ATTGCACTGG TGGACATCGG GGGGCGAAGC GAAAGCCGCC
AACTTGCTCA AGGAAAAGCT CGAGGCCAAG GGGCATACCT GGAAGGACTT CGCGGTTGCG
GGCGGCGCGG GCGACAGTGC CATGACGGTG CTCAAGTCGC GCGCGATTTC CGGCAACCCG
CCGGCGGTGG CCCAGATCAA GGGACCCTTG ATCCAGGAAT GGGGCGAGAT GGGCTTTCTC
GGCAATATCG ACAAGGCCGC CGAAGCCGAT GGCTGGGATG ATTTCCTGCC CCAGGAAATC
GCGGCCTATG ACAAGGTCGA CGGCCATTAC GCCGCGGTGC CGGTCAATAT TCACCGCATC
AACTGGATCT GGGCCAACCC GGAGGTGCTG AAGGCGTCGG GCGTCGAGGA AGTGCCGCAG
ACCTGGGACG CCTTCTTCGA AGCCGCGGAC AAGATTCGCG AGGCGGGCTA CATTCCGCTC
GCTCATGGTG GGCAGCCGTG GCAGGACGCG ACCGTGTTCG AAGTGGTCAT GATGGGCATC
GGCGGCGGCG ACTTCTATCG CAAGGCGTTC GTCGAGCTGG ACCCCGAGGC GCTGACCAGC
GACACCATGA TCGAGTCGCT CGAGACCTTC AAGAAGCTGC GCGGCACGAT GGACGACAAC
ATCGCCGGAC GGGACTGGAA CATCGCCACC TCCATGGTCA TCAACGGCAA GGCGGCCATG
CAGATCATGG GCGACTGGGC CAAGGGCGAG TTCACCGCCG CGGGCATGAC GCCGGGCGAG
GACTACGAAT GTGTCGCACC GCCCATGACG GAGCACATGT TTTCGTACAA CACCGACAGC
CTGGCGATGT TCGACGTCGA CGACGCAGGC CAGCAGCAGG CCCAGCTGGA TCTTGCGAGC
CTGGTGCTGT CGCCCGACTT CCAGGCCAGC TTCAACCAGG CCAAGGGCTC GATTCCGGTG
CGCCTGGACG TGCCGCTCGA CGACTTCGAC GCATGCGCCA AGGCATCTCG CGAGGCCTTC
GATGTCGCCA TGGACGAAGG CGGACTGGTG CCCAGCCTGG CACACGGCAT GGCGGTATCG
GACAGCCAGC AGGGCGCGGT GTTCGATGTC ATCACCAACT TCTTCAATGA CCCCGACATG
ACGGCCGAAA CGGCCGCCGA ACGTCTGGTC AGCGCGGTGC GCGCGGCCGA GTGA
 
Protein sequence
MKTIHALAAG ALLVTAVGQA QAAQIEVLHW WTSGGEAKAA NLLKEKLEAK GHTWKDFAVA 
GGAGDSAMTV LKSRAISGNP PAVAQIKGPL IQEWGEMGFL GNIDKAAEAD GWDDFLPQEI
AAYDKVDGHY AAVPVNIHRI NWIWANPEVL KASGVEEVPQ TWDAFFEAAD KIREAGYIPL
AHGGQPWQDA TVFEVVMMGI GGGDFYRKAF VELDPEALTS DTMIESLETF KKLRGTMDDN
IAGRDWNIAT SMVINGKAAM QIMGDWAKGE FTAAGMTPGE DYECVAPPMT EHMFSYNTDS
LAMFDVDDAG QQQAQLDLAS LVLSPDFQAS FNQAKGSIPV RLDVPLDDFD ACAKASREAF
DVAMDEGGLV PSLAHGMAVS DSQQGAVFDV ITNFFNDPDM TAETAAERLV SAVRAAE