Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1020 |
Symbol | araG |
ID | 4027866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1150794 |
End bp | 1152287 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637966197 |
Product | L-arabinose transporter ATP-binding protein |
Protein accession | YP_573076 |
Protein GI | 92113148 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.537406 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGAGG CTTTCTTACG CTTCGATGGA ATCAGCGTCG AGTTTCCCGG CGTCAAGGCC CTCGACGAGG TGAGCTTCTC CGCGCGTGCC GGGGAAGTGC ATGCCTTGAT GGGGGAAAAC GGCGCCGGCA AGTCGACGCT GCTCAAGGTG CTCAGCGGCG TCAATCGGCC CTCGTCGGGC CAGCTGTGGA TCGACGGCCA AGCGCATGTC TTCGCCAATG CGCGCGAGGC GCTGGCGCAC GGTATCGCGA TCATCTACCA GGAACTCACG CTGTCACCCA ATTTGTCGGT GGCCGAAAAC CTGTTGTTGG GACAGTTGCC CGAGCGCCGG GGGTTCATCG ACCGACGAAC CATGAAGGCA CGTGCCCGCG AGATTCTCGA GGAGCTGGGC GAGGGCGACA TCGACCCGGC GACCAAGGTG CGGGAGCTCT CCATCGGGCA GCAGCAGATG ATCGAGATTG GTCGGGCGTT GCTGCGCGAC GCGCGGATCA TCGCCTTCGA CGAGCCGACC AGCAGTCTCT CCGTGCAAGA AACGCGGCAG CTCAAGCGCA TCGTCGCACG ACTGCGTGAC GAGGGGCGTG TGGTGCTGTA CGTCACCCAT CGCATGGAAG AGGTCTTCGA GATGTGCGAT GCGGTGACCG TGTTCCGTGA TGGTCGCCAC ATTCGCACCC ACGAGACGCT GGAAGGGCTC GATCACGACA TGCTGGTCGG CGAGATGGTC GGCCGACAGA TCGACGACGT GTATGGCTTC CGTCCACGCG ACATCGGCGA TGTGCTGATG CGTATCGACG GCCTTCAGGG GCGCGGGGTC AACGAACCGG TCAATCTGGA GGTGCGACGC GGCGAGGTAC TGGGGTTGTT CGGCCTGGTG GGGGCAGGGC GTAGCGAGCT GATGCGACTG GTCTGCGGCG TGGAAAAGGC CAGCCGCGGG CAGGTCGCGT TGCGGGGCGA GACGCGTGTC TTTGCCTCGC CGCATCAAGC GATCCGCGCA GGCATCGCGA TGTGTCCGGA GGACCGCAAG TCCCAGGGGA TCTTCCCCGT GGCCAGCGTC TCCGACAACC TCAACATCAG TTGCCGGCGT TTTTTCCGTC GCTGGGGCAT GTTCCGGCAC GCCGCACGCG AGACCGACAA CGCCAAGACC TACATTCAGC GCCTGAGCAT CAAGACGCCG AGCCATCGCA CGCCGATCAA TACCTTGTCC GGCGGCAACC AGCAGAAAGT GATTCTCGGT CGCTGGCTGG CCGAGGAGAT CGACCTGTTC GTGATGGACG AACCCACGCG TGGCATCGAC GTGGGGGCGC GTCGCGATAT CTACGCCTTG TTGTACGACC TGGCGGAGCA GGGCAAGGGG GTGATCGTGA TCTCCAGCGA CCTCGCCGAG GTCAGCTCGA TCTGCGATCG CATCGGCGTC ATGCGTGACG GCGTCCTGGT CGACATCGTA CCGCGCGAGC AGGCAACCCA GGCGCGTCTG CTCGGTCTGG CCTTGCCCGC ATGA
|
Protein sequence | MSEAFLRFDG ISVEFPGVKA LDEVSFSARA GEVHALMGEN GAGKSTLLKV LSGVNRPSSG QLWIDGQAHV FANAREALAH GIAIIYQELT LSPNLSVAEN LLLGQLPERR GFIDRRTMKA RAREILEELG EGDIDPATKV RELSIGQQQM IEIGRALLRD ARIIAFDEPT SSLSVQETRQ LKRIVARLRD EGRVVLYVTH RMEEVFEMCD AVTVFRDGRH IRTHETLEGL DHDMLVGEMV GRQIDDVYGF RPRDIGDVLM RIDGLQGRGV NEPVNLEVRR GEVLGLFGLV GAGRSELMRL VCGVEKASRG QVALRGETRV FASPHQAIRA GIAMCPEDRK SQGIFPVASV SDNLNISCRR FFRRWGMFRH AARETDNAKT YIQRLSIKTP SHRTPINTLS GGNQQKVILG RWLAEEIDLF VMDEPTRGID VGARRDIYAL LYDLAEQGKG VIVISSDLAE VSSICDRIGV MRDGVLVDIV PREQATQARL LGLALPA
|
| |