Gene Csal_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1020 
SymbolaraG 
ID4027866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1150794 
End bp1152287 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content64% 
IMG OID637966197 
ProductL-arabinose transporter ATP-binding protein 
Protein accessionYP_573076 
Protein GI92113148 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.537406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAGG CTTTCTTACG CTTCGATGGA ATCAGCGTCG AGTTTCCCGG CGTCAAGGCC 
CTCGACGAGG TGAGCTTCTC CGCGCGTGCC GGGGAAGTGC ATGCCTTGAT GGGGGAAAAC
GGCGCCGGCA AGTCGACGCT GCTCAAGGTG CTCAGCGGCG TCAATCGGCC CTCGTCGGGC
CAGCTGTGGA TCGACGGCCA AGCGCATGTC TTCGCCAATG CGCGCGAGGC GCTGGCGCAC
GGTATCGCGA TCATCTACCA GGAACTCACG CTGTCACCCA ATTTGTCGGT GGCCGAAAAC
CTGTTGTTGG GACAGTTGCC CGAGCGCCGG GGGTTCATCG ACCGACGAAC CATGAAGGCA
CGTGCCCGCG AGATTCTCGA GGAGCTGGGC GAGGGCGACA TCGACCCGGC GACCAAGGTG
CGGGAGCTCT CCATCGGGCA GCAGCAGATG ATCGAGATTG GTCGGGCGTT GCTGCGCGAC
GCGCGGATCA TCGCCTTCGA CGAGCCGACC AGCAGTCTCT CCGTGCAAGA AACGCGGCAG
CTCAAGCGCA TCGTCGCACG ACTGCGTGAC GAGGGGCGTG TGGTGCTGTA CGTCACCCAT
CGCATGGAAG AGGTCTTCGA GATGTGCGAT GCGGTGACCG TGTTCCGTGA TGGTCGCCAC
ATTCGCACCC ACGAGACGCT GGAAGGGCTC GATCACGACA TGCTGGTCGG CGAGATGGTC
GGCCGACAGA TCGACGACGT GTATGGCTTC CGTCCACGCG ACATCGGCGA TGTGCTGATG
CGTATCGACG GCCTTCAGGG GCGCGGGGTC AACGAACCGG TCAATCTGGA GGTGCGACGC
GGCGAGGTAC TGGGGTTGTT CGGCCTGGTG GGGGCAGGGC GTAGCGAGCT GATGCGACTG
GTCTGCGGCG TGGAAAAGGC CAGCCGCGGG CAGGTCGCGT TGCGGGGCGA GACGCGTGTC
TTTGCCTCGC CGCATCAAGC GATCCGCGCA GGCATCGCGA TGTGTCCGGA GGACCGCAAG
TCCCAGGGGA TCTTCCCCGT GGCCAGCGTC TCCGACAACC TCAACATCAG TTGCCGGCGT
TTTTTCCGTC GCTGGGGCAT GTTCCGGCAC GCCGCACGCG AGACCGACAA CGCCAAGACC
TACATTCAGC GCCTGAGCAT CAAGACGCCG AGCCATCGCA CGCCGATCAA TACCTTGTCC
GGCGGCAACC AGCAGAAAGT GATTCTCGGT CGCTGGCTGG CCGAGGAGAT CGACCTGTTC
GTGATGGACG AACCCACGCG TGGCATCGAC GTGGGGGCGC GTCGCGATAT CTACGCCTTG
TTGTACGACC TGGCGGAGCA GGGCAAGGGG GTGATCGTGA TCTCCAGCGA CCTCGCCGAG
GTCAGCTCGA TCTGCGATCG CATCGGCGTC ATGCGTGACG GCGTCCTGGT CGACATCGTA
CCGCGCGAGC AGGCAACCCA GGCGCGTCTG CTCGGTCTGG CCTTGCCCGC ATGA
 
Protein sequence
MSEAFLRFDG ISVEFPGVKA LDEVSFSARA GEVHALMGEN GAGKSTLLKV LSGVNRPSSG 
QLWIDGQAHV FANAREALAH GIAIIYQELT LSPNLSVAEN LLLGQLPERR GFIDRRTMKA
RAREILEELG EGDIDPATKV RELSIGQQQM IEIGRALLRD ARIIAFDEPT SSLSVQETRQ
LKRIVARLRD EGRVVLYVTH RMEEVFEMCD AVTVFRDGRH IRTHETLEGL DHDMLVGEMV
GRQIDDVYGF RPRDIGDVLM RIDGLQGRGV NEPVNLEVRR GEVLGLFGLV GAGRSELMRL
VCGVEKASRG QVALRGETRV FASPHQAIRA GIAMCPEDRK SQGIFPVASV SDNLNISCRR
FFRRWGMFRH AARETDNAKT YIQRLSIKTP SHRTPINTLS GGNQQKVILG RWLAEEIDLF
VMDEPTRGID VGARRDIYAL LYDLAEQGKG VIVISSDLAE VSSICDRIGV MRDGVLVDIV
PREQATQARL LGLALPA