Gene Csal_2093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2093 
Symbol 
ID4029236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2362353 
End bp2363453 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content55% 
IMG OID637967292 
ProductABC transporter related 
Protein accessionYP_574143 
Protein GI92114215 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAGA TCACGCTCAA GTCCCTGGCG CATAGTTATT CGAAGAATCC CGCAACAGCA 
CAGGATTATG CCATACGCGA GATGAATCAT GTCTGGCATC AGGGCGGTGC CTATGCGCTG
CTTGGCCCTT CGGGGTGTGG CAAGTCGACC ATGCTCAATA TCATTTCAGG CTTGCTCGAG
CCATCGAACG GCGAGGTGCT CTTCGATGGC AAGGTTGTCA ATTCGTTGCC GCCAGAAGAA
CGCAATATTG CCCAGGTTTT TCAGTTTCCT GTCATCTACG ACACCATGAC GGTATACGAC
AACCTGGCAT TTCCATTGCG TAACATGAAA ACGCCGGAAG CTCGTGTTCA TGAACGTGTC
ATGGAAGTGG CCGATGTACT GGAGTTGACG CCCCAGCTCA AGCGCAAGGC CAAGAATTTG
AATGCCGATG AGAAACAGAA AGTTTCCATG GGGCGAGGTC TGGTACGCGA TGATGTGTCG
GCGATCCTGT TTGATGAGCC GCTGACGGTC ATCGACCCCC AGCTCAAGTG GAAGCTGCGG
CGCAAGCTCA AGGAAATTCA CCATCGCTTC CGCATCACGA TGGTCTATGT GACCCACGAC
CAGCTAGAGG CGTCGACGTT CGCTGACAAG ATAGCCGTGA TGTACGAGGG GCAGGTCGTG
CAGTTCGGCA CCCCCCGAGA GTTGTTCGAG ACGCCAAACC ACACCTTTGT CGGCTACTTC
ATCGGTAGCC CGGGAATGAA TTTTCTAGAC GTCACGGTAC GTGATGGCAA GGTGTTTTCC
GGTGAGGCGC CGCTTTTAGT CGACAAGCGA GTTTACGACG GCATCGCGCG TGCCACCTCT
GACAACGTCA AGCTGGGAAT TCGCCCCGAG TTCATCGAGG TGCACCAGGA GCCGGTCGAG
GACGGCATGG CAGTGAAGGT GGAGCATGCC CAGGATCTGG GGACCTATTC GATCGTGTAT
GCCTCACTCG GTGGTATCTC GCTGAAGGCG CGTGTCGAGG AAGATCAGAT AACGCCGGTC
GATAAAGCCT GGCTACGCTT CCCGAGCGAG AAGATCGGGT TATATGTCGA TGACTTCCGG
GTGGAGCTTC CTCATGAATA A
 
Protein sequence
MAEITLKSLA HSYSKNPATA QDYAIREMNH VWHQGGAYAL LGPSGCGKST MLNIISGLLE 
PSNGEVLFDG KVVNSLPPEE RNIAQVFQFP VIYDTMTVYD NLAFPLRNMK TPEARVHERV
MEVADVLELT PQLKRKAKNL NADEKQKVSM GRGLVRDDVS AILFDEPLTV IDPQLKWKLR
RKLKEIHHRF RITMVYVTHD QLEASTFADK IAVMYEGQVV QFGTPRELFE TPNHTFVGYF
IGSPGMNFLD VTVRDGKVFS GEAPLLVDKR VYDGIARATS DNVKLGIRPE FIEVHQEPVE
DGMAVKVEHA QDLGTYSIVY ASLGGISLKA RVEEDQITPV DKAWLRFPSE KIGLYVDDFR
VELPHE