Gene Csal_1517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1517 
Symbol 
ID4029213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1725863 
End bp1726813 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content63% 
IMG OID637966700 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_573569 
Protein GI92113641 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00127851 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAACA AGACGTCTTT GATCGCGCTG CTGGGCGCCA CTACATTGCT TTCACCCCTG 
GCCATGGCAA ATACGCCGGA AAGCTGCCAG CCGGTACGCT TCGCCGAAGT CGGCTGGACC
GACATCACCG CCACCACCGC CTTGACCCGT GAGGTGCTCG AGGGCCTCGG TTACGAGACG
ACGTCCAACA CCGTCTCCGT GCCGGTGGCC TACGCCGGGA TGAAGAACGG CGACTTCGAC
GTGTTCCTGG GCAACTGGAT GCCGTCGATG GCCTCGATCA GCGACGAGTA TATCGACAAG
GGACAGGTGG ATCGTCTCGG CGCCAACCTG GAGGGGGCCA AGTACACCCT GGCGGTGCCG
CAGTACGTCT ACGATGCCGG CGTGACCTCG GTCGAAGACC TGGACGCGCA TGCGGACAAG
TTCGATAGCC GCCTGTACGG CATCGAAGCG GGCAACGACG GCAATCAGAT CATCCAGCAG
ATGATCGATG ACGATGCCTT CGGGCTGGGC GACTGGAGCC TGATCGACTC GTCCGAATCG
GGCATGCTCG CCGAACTCAA CTCCCGCACC CAGAGCGAGG AATGGATGGT GTTCCTGGGG
TGGGAGCCGC ACCCGATGAA CACCAACTAC GAAATGGCCT ATCTGGAAGG CGCCGATGAC
TACTTCGGTC CCAACCTCGG CGGCGCGACC GTGTATACCA ACACGCGTGC CGGGTACGCA
GAGGCCTGCG GTAACGTGGG CGAGCTGCTC AACAACCTGA GCTTCACGCT GTCCATGGAA
AACGAGATCA TGGGCGCCAT CATGGACGAC GGCGAGGATC CGCGCGATGC GGCGCGCACC
TGGCTGCAGA ACAACCCGTC CGTCCTCGAT GAGTGGCTCC AGGGCGTGAC CACCGTCGAA
GGCGAGCCCG GGCTGGCGGC CGTGAAGAAA GCGCTGGACA TCGACAGCTG A
 
Protein sequence
MSNKTSLIAL LGATTLLSPL AMANTPESCQ PVRFAEVGWT DITATTALTR EVLEGLGYET 
TSNTVSVPVA YAGMKNGDFD VFLGNWMPSM ASISDEYIDK GQVDRLGANL EGAKYTLAVP
QYVYDAGVTS VEDLDAHADK FDSRLYGIEA GNDGNQIIQQ MIDDDAFGLG DWSLIDSSES
GMLAELNSRT QSEEWMVFLG WEPHPMNTNY EMAYLEGADD YFGPNLGGAT VYTNTRAGYA
EACGNVGELL NNLSFTLSME NEIMGAIMDD GEDPRDAART WLQNNPSVLD EWLQGVTTVE
GEPGLAAVKK ALDIDS