Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1903 |
Symbol | |
ID | 4026816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2161109 |
End bp | 2162326 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637967097 |
Product | glycine betaine/L-proline transport ATP binding subunit |
Protein accession | YP_573954 |
Protein GI | 92114026 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4175] ABC-type proline/glycine betaine transport system, ATPase component |
TIGRFAM ID | [TIGR01186] glycine betaine/L-proline transport ATP binding subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0888072 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAA CGGATGAAAC CAACGAAAGC AATATCAAGA TTCAGGTACG CGGCCTGAGC AAGGTGTTTG GCTCCCAGCC AAAGAAAGCG CTCGAATTGC GCAATCAGGG AAAGAAGCGT CCCGAGATCC TCGAAAAAAC CGGCCAGACG CTGGGGCTTT CGAACATCGA CTTCGATGTG CGTGAGGGCG AGCTCCTCGT CATCATGGGG TTGTCGGGGT CCGGCAAGTC GACGCTGATC CGCTGTCTCA ACCGACTGAT CGAACCCACC GAAGGCGACA TCATCATCGA TGGTCAGAAC ATTCCCAAGC TCAACGAGAA AGAGCTGCTG GAATGTCGCC GTCGCCACTT TTCGATGGTA TTCCAGAACT TCGCGCTGTT TCCGCACCGT ACGGTGCAGG AGAACGCCGA GTACGGCCTC GAAGTTCGCG GTATCGAAAA ATCCTCGCGT GTCGAAAGCG CGCGTAACTC CCTCAAGCAA GTCGGCCTGG AAGGGTGGGA AGACGCCTAT CCGAACCAGC TTTCCGGCGG CATGCAGCAG CGGGTCGGTC TGGCACGCGC GTTGGCCAAC GACTCCACCG TGATGCTGAT GGACGAAGCC TTCTCGGCGC TGGATCCGTT GATCCGCAAG GATATGCAGC AAGAGCTGAT CGAACTGCAG CATCGCATGA AGAAGACCAC GATCTTCATT ACCCACGACC TCGACGAAGC GATCAGCATC GGTGACCGCA TCATCCTTCT CAAGGATGGC GAGATCGTCC AGAGCGGCAC GCCCGAAGAG ATTCTGACGC GTCCCGCCGA CGATTACGTG GCTCGCTTCG TCGAGGGCGT GGACATGTCA CGCGTGCTGA CAGCCACCAG TGCCATGCGC CCCGTGCGCG CGACGGCGCG CGACAGCGAC GGTCCCCGTA CCGTGCTGCG CAAGATGAGC GACAATGGAC TCGATTCCAT CTATGTCATC GGGCGTGATC GCACCTTGCT GGGTATCGTC GGGGTCGACG ATGTCGATGC GGCCGCCAAG GCCGGCAAGG ATACGATTCA CGAGTTGATC CACGATGACT TCCCGAAAGC CGGGCCGGAT GAACCGATGA ACAATCTCTT TGCCATGTTC AGCGAGAAAA GTTACCCGAT CGCCATCGTC GACGAAAATC AACGCCTGCT GGGCGTCGTC GTGAAGGGCG CGGTACTCGA ACAACTGGCT GAAGCGGGAG AGCACTGA
|
Protein sequence | MTETDETNES NIKIQVRGLS KVFGSQPKKA LELRNQGKKR PEILEKTGQT LGLSNIDFDV REGELLVIMG LSGSGKSTLI RCLNRLIEPT EGDIIIDGQN IPKLNEKELL ECRRRHFSMV FQNFALFPHR TVQENAEYGL EVRGIEKSSR VESARNSLKQ VGLEGWEDAY PNQLSGGMQQ RVGLARALAN DSTVMLMDEA FSALDPLIRK DMQQELIELQ HRMKKTTIFI THDLDEAISI GDRIILLKDG EIVQSGTPEE ILTRPADDYV ARFVEGVDMS RVLTATSAMR PVRATARDSD GPRTVLRKMS DNGLDSIYVI GRDRTLLGIV GVDDVDAAAK AGKDTIHELI HDDFPKAGPD EPMNNLFAMF SEKSYPIAIV DENQRLLGVV VKGAVLEQLA EAGEH
|
| |