Gene Csal_1903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1903 
Symbol 
ID4026816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2161109 
End bp2162326 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content59% 
IMG OID637967097 
Productglycine betaine/L-proline transport ATP binding subunit 
Protein accessionYP_573954 
Protein GI92114026 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0888072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAA CGGATGAAAC CAACGAAAGC AATATCAAGA TTCAGGTACG CGGCCTGAGC 
AAGGTGTTTG GCTCCCAGCC AAAGAAAGCG CTCGAATTGC GCAATCAGGG AAAGAAGCGT
CCCGAGATCC TCGAAAAAAC CGGCCAGACG CTGGGGCTTT CGAACATCGA CTTCGATGTG
CGTGAGGGCG AGCTCCTCGT CATCATGGGG TTGTCGGGGT CCGGCAAGTC GACGCTGATC
CGCTGTCTCA ACCGACTGAT CGAACCCACC GAAGGCGACA TCATCATCGA TGGTCAGAAC
ATTCCCAAGC TCAACGAGAA AGAGCTGCTG GAATGTCGCC GTCGCCACTT TTCGATGGTA
TTCCAGAACT TCGCGCTGTT TCCGCACCGT ACGGTGCAGG AGAACGCCGA GTACGGCCTC
GAAGTTCGCG GTATCGAAAA ATCCTCGCGT GTCGAAAGCG CGCGTAACTC CCTCAAGCAA
GTCGGCCTGG AAGGGTGGGA AGACGCCTAT CCGAACCAGC TTTCCGGCGG CATGCAGCAG
CGGGTCGGTC TGGCACGCGC GTTGGCCAAC GACTCCACCG TGATGCTGAT GGACGAAGCC
TTCTCGGCGC TGGATCCGTT GATCCGCAAG GATATGCAGC AAGAGCTGAT CGAACTGCAG
CATCGCATGA AGAAGACCAC GATCTTCATT ACCCACGACC TCGACGAAGC GATCAGCATC
GGTGACCGCA TCATCCTTCT CAAGGATGGC GAGATCGTCC AGAGCGGCAC GCCCGAAGAG
ATTCTGACGC GTCCCGCCGA CGATTACGTG GCTCGCTTCG TCGAGGGCGT GGACATGTCA
CGCGTGCTGA CAGCCACCAG TGCCATGCGC CCCGTGCGCG CGACGGCGCG CGACAGCGAC
GGTCCCCGTA CCGTGCTGCG CAAGATGAGC GACAATGGAC TCGATTCCAT CTATGTCATC
GGGCGTGATC GCACCTTGCT GGGTATCGTC GGGGTCGACG ATGTCGATGC GGCCGCCAAG
GCCGGCAAGG ATACGATTCA CGAGTTGATC CACGATGACT TCCCGAAAGC CGGGCCGGAT
GAACCGATGA ACAATCTCTT TGCCATGTTC AGCGAGAAAA GTTACCCGAT CGCCATCGTC
GACGAAAATC AACGCCTGCT GGGCGTCGTC GTGAAGGGCG CGGTACTCGA ACAACTGGCT
GAAGCGGGAG AGCACTGA
 
Protein sequence
MTETDETNES NIKIQVRGLS KVFGSQPKKA LELRNQGKKR PEILEKTGQT LGLSNIDFDV 
REGELLVIMG LSGSGKSTLI RCLNRLIEPT EGDIIIDGQN IPKLNEKELL ECRRRHFSMV
FQNFALFPHR TVQENAEYGL EVRGIEKSSR VESARNSLKQ VGLEGWEDAY PNQLSGGMQQ
RVGLARALAN DSTVMLMDEA FSALDPLIRK DMQQELIELQ HRMKKTTIFI THDLDEAISI
GDRIILLKDG EIVQSGTPEE ILTRPADDYV ARFVEGVDMS RVLTATSAMR PVRATARDSD
GPRTVLRKMS DNGLDSIYVI GRDRTLLGIV GVDDVDAAAK AGKDTIHELI HDDFPKAGPD
EPMNNLFAMF SEKSYPIAIV DENQRLLGVV VKGAVLEQLA EAGEH