Gene Clim_0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0994 
Symbol 
ID6355443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1085036 
End bp1086310 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content52% 
IMG OID642668618 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001943049 
Protein GI189346520 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAT CGCCACTGGT TATTCTCCTT CTTACCGTTA TGCTCGATCT GATCGGGTTC 
GGCATTGTGC TGCCGCTGCT GCCGACCTAC GCAAAAGATC TCGGCGCCAG TCCCTTCATG
ATCGGTCTCA TTGCCGCGAT ATTCTCCATC ATGCAGTTCA TCTTCTCCCC GCTCTGGGGC
AAGCTGAGCG ACAAGATCGG ACGCAGGCCG GTTATGCTTA TCAGCATATT CGTTACGGCA
CTCTCCTATC TGATTTTTTC CCAGTCCGAT ACCATTCTGC TTCTGATTTT CGCAAGAGGA
CTTTCAGGCA TCGGCTCGGC CAATATCGCC GCCGCGCAGG CATACATTAC CGATGTGACC
GACAGCAAAA GCCGCTCGGG GGCCATGGGC ATGATCGGCG CGGCGTTCGG CGTAGGATTC
ATTATCGGCC CCCTTGTCGG CGGCCTGCTC AAGCATAACT ATGGCATAGA AATGGTCGGG
TACGTTGCTT CGGCCCTTAT ATTCTTCGAT TTCATTCTTG CCATATTCTT TCTGCCGGAA
TCCAACAAAC AGGCAAAAAA GTTCAATCTC GGCATGTTCA GCGGCAAAAG CGCGGCCAGT
GGCAGTGCAA ACGGCAGCTC GTCCTCGTTT CTGAAAGAGA AGATTACGGA ATATGTCGAT
GGCCTGAAGC TGACGTTCAG TTCGAAGCCG CTCGCTCTGC TGATGATCGC AAACTACATA
TTCACCTTCG CCATCGTCAA CATGCAGGTT GCTTCTATCC TGCTCTGGAA GGAGTATTTC
GGTGCTTCCG ATCAGGAGAT CGGCTATATT TTTGCCTATG TCGGCTTTTT CTCGGTTATT
GTGCAGGGGG GGCTGATCAG GCAGCTCATC AAGAAACTCG GAGAACACAA ACTGTTTCTC
TGGGGCCATA TCTTTACCTT TGCCGGCGTC TTTTTTGTCC CGTTCATTCC TCAGGATACG
TTGTTCTCGT ACGGACTGCT CATACTGCTC TGCTTTGCGA TCGGCACGAG TCTCGTTGCG
CCCATCAACC TCTCCATGAT CTCTCTCTAC AGCTACAAAC AGAAGCAGGG GCAGATTCTC
GGTCTTTCGC AGTCGGTCAA CTCGTTTGCA CGCATCATGG GGCCGTTCAG CGGCAGTATT
CTTTACGGCA TGAATTTTCA TGCCCCCTAC ATCGTAGCCG GTCTGCTTAC CATCATTGGC
ACGTTTATAG CGTTCGCCCT GTTCAAGTAT AAAATCGAAG CCCTCGAACC CGTTGCGGAA
GCCGAAAGCA TCTGA
 
Protein sequence
MKKSPLVILL LTVMLDLIGF GIVLPLLPTY AKDLGASPFM IGLIAAIFSI MQFIFSPLWG 
KLSDKIGRRP VMLISIFVTA LSYLIFSQSD TILLLIFARG LSGIGSANIA AAQAYITDVT
DSKSRSGAMG MIGAAFGVGF IIGPLVGGLL KHNYGIEMVG YVASALIFFD FILAIFFLPE
SNKQAKKFNL GMFSGKSAAS GSANGSSSSF LKEKITEYVD GLKLTFSSKP LALLMIANYI
FTFAIVNMQV ASILLWKEYF GASDQEIGYI FAYVGFFSVI VQGGLIRQLI KKLGEHKLFL
WGHIFTFAGV FFVPFIPQDT LFSYGLLILL CFAIGTSLVA PINLSMISLY SYKQKQGQIL
GLSQSVNSFA RIMGPFSGSI LYGMNFHAPY IVAGLLTIIG TFIAFALFKY KIEALEPVAE
AESI