Gene Cag_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0039 
Symbol 
ID3747238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp42581 
End bp44194 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content50% 
IMG OID637772565 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_378361 
Protein GI78188023 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily
[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.677272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAACG CACCTTCTCT TTCAGCCTCG CCAAAGCTAC TCGGCACCAC CGAAGAGCAT 
TACGAAACAG GGTGGCGCAA GCTCATTATC ACCTTGACGG TTATTGTTTC CGCCATGCTG
GAGCTGATTG ACACCACCAT TGTTAATGTG GCAATTACGC AAATTAGCGG CAACCTTGGA
GCCAGCATTG AGGACACCGC ATGGGTGGTA ACAAGCTACG CCATTGCAAA CGTTATTGTA
ATTCCACTCT CAGGCTTTCT TGGCAATTTG CTTGGGCGGC GGAACTACTA CATTGGCTCC
ATTCTGCTCT TTACCGTCGC CTCTCTCCTG TGCGGCGTTG CAACCGACAT TTGGACACTT
GTCTTTTTCC GCTTTGTGCA AGGCATTGGC GGTGGCGCAC TGCTCCCCAC CTCGCAAGCC
ATTTTGTACG AAACCTTTCG CCCCGAAGAG CGCGGCAAAG CCACCGGTAT TTTTTCAATG
GGCTTAGTGC TTGGACCAAC CATTGGACCA CTTTTAGGCG GCTACTTAGT AGATTATTTC
AATTGGGAAT GGTGCTTTTT TGTCAACATT CCTATTGGAC TGTTAGCTGC TTGGTCATCC
TTTATTTTTC TTAAAGAGCC AAAAGTTACC CACACTGTCT CAAAAATTGA TTGGGCTGGA
ATTGGCTTAC TTGCCGTTGG CATTGGTTCC CTGCAATTCA TTTTAGAGCG AGGAGAATCC
AAAGATTGGT TTGAAACCCC CTACATTACA TGGTTTACCA TTATTGCCGT ACTTTCGCTG
ATTGCGTTTG TATGGCACGA ACTTCACACT AAGGAGCCTG CCGTTGACCT TCGTGTGCTG
GCACGAAGCC ACAATTTGCC CATTGCTGCC GTGCTCACCT TTATTGTCGG CTTTGGTTTG
TACGGTTCAC TCTTTGTTTT TCCCGTTTTT GTGCAAGGGC TGCTTGGTTT TACCGCTGTG
CTCACCGGTT TAGTGCTCTT CCCCAGCGCT ATGGTTACCG GTATGATTTC CATGCCACTT
GGCATGGCGC TGCAAAAAGG TGCCTCACCA AAGCATTTAA TGCTCTTTGG AATGCTCACC
TTTTCACTTT TTTGCTGGCT ACTGGGGCAA CAAACCTTGC AATCAGGCGC CGAAAACTTT
TTTTGGATAT TGCTGCTTCG CGGCATTGCA CTCGGCTTTA TTTTCATTCC CGTTACCATG
CTCGCAATTT CGGGATTGCA TGGCAAAGAT ATTGGACAGG CAACTGGCTT AAACAACATG
GTGCGCCAAC TTGGCGGCTC ATTCGGCATT GCTATTGCCA ACACCTACAT CGCCAAACGA
GTAGCCGCAC ACCGCACCGA GCTACTAAGC CATCTTTCGC CTTACGACCC CGAAGCAATG
AACCGCATAC ACGCCATTGC CGCCAAAGCC ACTGCTGAAC ACGGGCTGCC ACCCGCAAGC
GCCGAACTTG CCGCCCTGAA AGCGCTTGAA GGTACGGTAA CCGTGCAAAG CACGCATCTT
GCCTTTATGG ATGCCTTTAT GCTGATTGCT CTTCTTTTTC TCTGCGCTGT GCCACTACTC
TTTTTTATTC GGCTGCATAA GGGGGAACAG GCAAGTGCAA TGGGGGGGCA TTGA
 
Protein sequence
MANAPSLSAS PKLLGTTEEH YETGWRKLII TLTVIVSAML ELIDTTIVNV AITQISGNLG 
ASIEDTAWVV TSYAIANVIV IPLSGFLGNL LGRRNYYIGS ILLFTVASLL CGVATDIWTL
VFFRFVQGIG GGALLPTSQA ILYETFRPEE RGKATGIFSM GLVLGPTIGP LLGGYLVDYF
NWEWCFFVNI PIGLLAAWSS FIFLKEPKVT HTVSKIDWAG IGLLAVGIGS LQFILERGES
KDWFETPYIT WFTIIAVLSL IAFVWHELHT KEPAVDLRVL ARSHNLPIAA VLTFIVGFGL
YGSLFVFPVF VQGLLGFTAV LTGLVLFPSA MVTGMISMPL GMALQKGASP KHLMLFGMLT
FSLFCWLLGQ QTLQSGAENF FWILLLRGIA LGFIFIPVTM LAISGLHGKD IGQATGLNNM
VRQLGGSFGI AIANTYIAKR VAAHRTELLS HLSPYDPEAM NRIHAIAAKA TAEHGLPPAS
AELAALKALE GTVTVQSTHL AFMDAFMLIA LLFLCAVPLL FFIRLHKGEQ ASAMGGH