Gene Csal_0137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0137 
Symbol 
ID4026589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp158478 
End bp159761 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content58% 
IMG OID637965288 
Productmajor facilitator transporter 
Protein accessionYP_572200 
Protein GI92112272 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0711872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACAA CGACTGCATC TTCCCAATCA TCGGCACCGG CAGCCGAGAC ACCTGACGAC 
GTACAACACG CAGGCATCAA ACGCTGGGCG ATTCCTTTTG CCTTGCTTTG CTGCGTCATT
CTTTCCTTTT TCGACAAGAT CAGCATCGCG GTACTGCTTT CCAGCCCTGC CTTCCAAGGC
GCATTGGGTG TAGAAGGCGA AACCACCAAG CTGGGATTCT TGATGACGGG GTTTCTCCTG
TCCTACGGCT TCTCTTCGAT GTTCCTTGGG TTCTTGAGCG ACATCATCAG CCCCAAGAAA
TGTCTCTACG CCATGCTGGT GTTGACGGCC GTCATCATGA CGATCATGGG GTTCGCGGAA
TCCTACGCTC AGATGTTGAC GTTGCGCATC GTGCTGGGCA TTGCCGAGGG GCCGATGTTC
GCCATCGCCT ATACCATCGT CAAGCGTACC TTCGCACCCC GCGAACAGGC GCGTGCCACC
ATGCTCTGGC TTTTAGGCAC ACCCATCGGT GCGTCGATCG GATTCCCCTT CACGATCTAT
ATTCTGGATA ACTTTGGATG GCGGGCGACC TTCTTCGCCA TGGCCTTGCT GACGATTCCT
CTTCTCATGC TGGTGGTATG GGTGTTTCGC AAGGTCGATG TATCGACGAA ATCGCCCTTG
CAAAAGGCGC GCGGCGACAA GCATGTGCCC CTGTCCAGCC ATCGCCAGTC GACTCGGGAA
TTGTTCGGAA ATCTCTCCTT CTGGGCGATC TGTGTCTTCA ATATCGCGTT CCTGGCCTAT
CTCTGGGGAC TCAATAGCTG GCTGCCGACC TACCTGGTCG AGGACAAGGG CATCGACCTC
GATTCGGCGG GCATTTTTTC CTCCCTTCCG TTCATCGCGA TGCTGGTCGG CGAGGTCGTG
GGAGCGTTCG TCTCCGATCG GTTCGACAAG CGGGCGTTGA TGTGTGCGTT CTCGTTGTTC
GGCGCCGGTC TCGGCCTGCT GCTCGTGCTG TCCATCGACG CCTCGAATAT CGCCATGATC
TCCGCCATGT CGTTCAGCGC CATGTGCTGG GGCGTGGGCG CCCCCAACAT CTTCGCCCTG
CTGGCCAAGG CGACACCTGC CAAGGTCAGT GCCACGGCCG GCGGCATTCT CAATGGATTC
GGCAATTTTT CCGGTGCTTT GACGCCGGTC ATCATCGGCG CCCTGATTGC CTCGACCGGC
AATATGACGG CCGGACTCTT GTTCATGGCG GTGTTGGCCT TTGCCGGCGG TGCTGTGCTG
TTGCCCCTGG TGCGGCGTTA CTGA
 
Protein sequence
MTTTTASSQS SAPAAETPDD VQHAGIKRWA IPFALLCCVI LSFFDKISIA VLLSSPAFQG 
ALGVEGETTK LGFLMTGFLL SYGFSSMFLG FLSDIISPKK CLYAMLVLTA VIMTIMGFAE
SYAQMLTLRI VLGIAEGPMF AIAYTIVKRT FAPREQARAT MLWLLGTPIG ASIGFPFTIY
ILDNFGWRAT FFAMALLTIP LLMLVVWVFR KVDVSTKSPL QKARGDKHVP LSSHRQSTRE
LFGNLSFWAI CVFNIAFLAY LWGLNSWLPT YLVEDKGIDL DSAGIFSSLP FIAMLVGEVV
GAFVSDRFDK RALMCAFSLF GAGLGLLLVL SIDASNIAMI SAMSFSAMCW GVGAPNIFAL
LAKATPAKVS ATAGGILNGF GNFSGALTPV IIGALIASTG NMTAGLLFMA VLAFAGGAVL
LPLVRRY