Gene Clim_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1686 
Symbol 
ID6353993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1853679 
End bp1855043 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content54% 
IMG OID642669291 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001943707 
Protein GI189347178 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000674894 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAT GGACAGCTCC GAACTATTTT CATGAATCGC AAGACTCCAT GCTCCCAATG 
TCCTCTCAAG AACAACGTCG TCTCGGCCCG GTACAATTAT CACCGAACAT CCTGCCCCGC
CACGGACTGA CATTTCTCTT TTCGGCGTTT TTTTCCATAG GCCTTGTCAC CTTCGTTTCA
ATCGGACAGG CCTATATTCT CAATGAAAAC CTGAAAATAC CCGTATCGGA ACAGGGCACA
ATAAGCGGCA ACCTTGTTTT CTGGACGGAA ATCGTCACCC TTCTCTTTTT TGTTCCAGCC
GGCGTCCTGA TGGACCGGAT CGGACGCAAA CAGGTTTTTA GCGCCGGCTT CCTCCTGCTT
GCCCTTGCCT ACGCACTCTA CCCTACTGCG TCTTCGATCG GAGAACTCAC CCTTTTCCGT
ATAATTTATG CGCTTGGAAT CGTTGCGGTT ACCGGAGCCC TCTCCACCAT CATGGTTGAC
TATCCTGCCG AGCGGTCGCG CGGCAAGCTT ATTGCCATTA CCGGGTTTCT CAATGGCCTC
GGTATCGTCG TCCTTAACCA GTTCTTCGGA GCACTGCCCC AGAGGCTGAC CGCCGGAGGC
ATGAGCGGAA CTGATGCCGG ACTCTATACC CATTTCGGAA TAGCCGGCAT CGCACTGATC
TCGGCAGTTG TGGTCAGTGT CGGTCTGAAA AGCGGAACCC CGGTAAAAAA AGAGGAGCGC
CCGCCCCTGA AAAAACTGCT CACCAGCGGT ATAAGCTATA TGAAAAACCC GAGAATCCTG
CTCTCATACG GTGCGGCATT TGTCGCACGC GGAGACCAGT CGATCATCGG TACCTTCCTG
CCGCTATGGG GAACAACTGC CGGCATCGCC ATGGGCATGG ATCCTGCTGA AGCCGTTAAA
AAAGGCACGC TCATCTTTAT CATCTCCCAG GCAGCGGCAC TGCTCTGGGC TCCCGTAATC
GGCCCGGTTA TCGACAGGAT GAACCGGGTC AGCGCTCTTG TGCTCTGCAT GTTTCTTGCC
AGTGCAGGAT ACCTGTCACT TGGTTTCGTC GGCAACCCTC ATGAACCGTT CTCTCTCTTT
TTCTTCATGC TTCTCGGAAT CGGCCAGATC AGCTCTTTCC TCGGAGCACA ATCGCTAATC
GGCCAGGAGG CTCCTAAAGC CGAGCGGGGA TCGGTAATCG GCATGTTCAA CATCAGCGGA
GCAATCGGCA TTCTTATCAT CACCTCTACA GGAGGCCGGC TTTTTGACGG AATGAGCCCG
AAAGCTCCCT TTCTTGTTGT CGGCGCTATA AACCTGCTGG TCATGCTTGC CGGAATTCTT
GTACGCATCC ATGCGCCGGG AAAAGCTGCC GGGGGTGAAG AATAA
 
Protein sequence
MNAWTAPNYF HESQDSMLPM SSQEQRRLGP VQLSPNILPR HGLTFLFSAF FSIGLVTFVS 
IGQAYILNEN LKIPVSEQGT ISGNLVFWTE IVTLLFFVPA GVLMDRIGRK QVFSAGFLLL
ALAYALYPTA SSIGELTLFR IIYALGIVAV TGALSTIMVD YPAERSRGKL IAITGFLNGL
GIVVLNQFFG ALPQRLTAGG MSGTDAGLYT HFGIAGIALI SAVVVSVGLK SGTPVKKEER
PPLKKLLTSG ISYMKNPRIL LSYGAAFVAR GDQSIIGTFL PLWGTTAGIA MGMDPAEAVK
KGTLIFIISQ AAALLWAPVI GPVIDRMNRV SALVLCMFLA SAGYLSLGFV GNPHEPFSLF
FFMLLGIGQI SSFLGAQSLI GQEAPKAERG SVIGMFNISG AIGILIITST GGRLFDGMSP
KAPFLVVGAI NLLVMLAGIL VRIHAPGKAA GGEE