Gene Csal_0219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0219 
Symbol 
ID4027302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp245720 
End bp246688 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content65% 
IMG OID637965370 
ProductABC-type glycine betaine transport system protein 
Protein accessionYP_572282 
Protein GI92112354 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACTG CATCCCACCC GCTGGCGCGC GGCACCAAGG CGGCGCTGCT TGGTGCCTGC 
CTGGCCGGCC TGCCGCTGGG CATGGCGCAG GCGCAGGACG ACACGACGGA CGTGCGCTTT
TCCGTACCAC CGTGGCCGGG CGTGACGGTC AAGACCGAAC TCGCCGCGCA GTTGCTCGAT
ACGCTGGGGT ATACGCCGCA GCAGGAGCAG CTCGGCACCA CCATCACCTA CCAGGCGCTC
AACCAGAACG AGTTGGACGC CTTCCTCGCC GGATGGCTGC CCGCCCAGCA GGGCATGTAC
GACACCGCCC TGGAAAAGGG CAAGCTGGTC GATCTGGGCA ACAACGTCGA TGGCGCGCGC
ATCGGCTTCG CGGTGCCCAG CTACGTCTTC GATGCCGGCG TCACCTCCGC CGAGGATCTC
GACACGCCGG AAAACGCCGA GCGTTTCGGG CGTACCGTCT ACTCCATCGA GACCGGCACG
GGCATGAGCG AGCAGCTCAA TGCCGGCGTC GCCAGCGATA CTTACGGCCT GGGCGACTGG
GAGCTTTCCG AGACATCCAC GCCGGGCATG CTCGGCGCCG CCGACAGTGC CATCGACAAC
CAGGAGTGGA TCGTCTTCGC CGGCTGGACG CCGCACTGGA TGAACATCAA GTACGACATC
GCCTATCTCG ACGACCCCGA GGACTTGTGG GGAGAGGACG GCGGTCGCAG CGACGTGCGC
ACCCTGGTCA CGAAGACATT CTCCGAGACG CACCCCAATG CCACCAGGTT GCTCGATCAA
CTGGACTTCA CCGCCGACGA CCAGAGCGAC ATGATCCGTC GCTACGATCA GGACGGGATG
CCCAAGGACG AAGCTGCCAT CGCCTGGATG CGCGACAACG CCGACAAGGT GGAAGGCTTT
GTCGATGGCG TCACCACGCG TGACGGCGAG CCTGCCTGGC CGGTGGTGAA AGAAGCGTTC
GACCTGTAG
 
Protein sequence
MMTASHPLAR GTKAALLGAC LAGLPLGMAQ AQDDTTDVRF SVPPWPGVTV KTELAAQLLD 
TLGYTPQQEQ LGTTITYQAL NQNELDAFLA GWLPAQQGMY DTALEKGKLV DLGNNVDGAR
IGFAVPSYVF DAGVTSAEDL DTPENAERFG RTVYSIETGT GMSEQLNAGV ASDTYGLGDW
ELSETSTPGM LGAADSAIDN QEWIVFAGWT PHWMNIKYDI AYLDDPEDLW GEDGGRSDVR
TLVTKTFSET HPNATRLLDQ LDFTADDQSD MIRRYDQDGM PKDEAAIAWM RDNADKVEGF
VDGVTTRDGE PAWPVVKEAF DL