Gene Cag_0971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0971 
Symbol 
ID3746896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1307812 
End bp1310922 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table11 
GC content48% 
IMG OID637773501 
ProductNolG efflux transporter 
Protein accessionYP_379277 
Protein GI78188939 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00914] heavy metal efflux pump (cobalt-zinc-cadmium)
[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTTA CTGAGCTTTC CATAAAACGC CCCACGCTGA TTGTGGTATT TTTTACCGTA 
CTTGCAATCC TTGGCTTGTT TAGTTATCGG CAGTTGCAGT ATGAGTTACT GCCAAAAATG
ACGCCGCCCG TTGTCTCCAT TTCAGTAGTG TATCCGGGTG CCTCTCCCTC TGAAGTTGAA
ACATCTCTTA CCAAACCGCT TGAAGAGGCG GTTTCGGCGC TTGAGAAAAT CTCCTCTATT
TCCTCTACCT CAACCGAAGG GTTATCGGTA GTAACCATTG AGTTCGATAA CAGCGCCGAT
ATTGAGCAAT CGTTGCAAGA TGCTCAGCGC AAAATTAACG AGGTAAGTGA TTTGTTGCCG
ACGGAGGCAA AAACGCCCGT TATTAGCCGT TTTGCTCTTG ATGAGGTGCC CGTTTTGCGC
ATTGGCGCTA CTTCCTCTCT GCCCGATAGC GAGTTCTACC AAATGTTAAA AGATGAGGTA
AAGCCGCAAC TCTCCTCTGT TGCGGGCGTG GGACAGGTTT ATCTTGTTGG CGGCAGAGAG
CGCGAAATTA GGGTGAACCT TGATCTTGAT CGTATGCAAG CCTATGGTTT AACCGTAACT
GATGTGGTGC GCGAGGTAGA AAAAGCCAAT AGCGATTTTC CAACGGGTGC TATTGAGGAG
ACGGAGCGCC AATTTGTGGT ACGCCTTGCT GGTAAATTCC GCTCACTTGA AGAGCTGCAA
GCGCTCATTA TTCAAGCAAC GCCCGAAGGT AACGTGCAAT TGCGTGATAT TGCTACGGTT
GAAGATGGCT TTCGGGAAAT AACCACGTTG TCGCGTTTAA ACGGGCGCGA AAGCGTTGGC
ATTCTTATTA TGAAGCAAAG CGATGCCAAT ACGGTTGATG TGAGCCGTTT AGTGCGTGCC
TCGCTTTCAA AAATTGAGCA ACTCTACAGC AACCGCAATC TTCGCTTTTC CGTAGCGCAA
GATGCCTCCA CCTTTACGCT TGATGCTGTT ACAGCCGTGC AGCACGACTT GCTTTTAGCC
GTACTGCTTG TAGCGTTTGT TATGATGCTC TTTTTGCACA GCTTGCGTAA CTCGCTCATT
GTGCTGGTTT CCATTCCCAC CTCAATGGTA ACCACCTTTA TTGGCATGTA CCTCTTCGAC
TTTTCGCTTA ACCTGATGAC GCTGCTCTCG CTTTCGCTTG CTGTTGGCGT TTTGGTTGAC
GATTCCATTG TGGTGTTGGA AAATATTTAT CGCCACCTTG AAAAAGGAGA GGAGCCACGC
CATGCTGCTA TCGTTGGACG TAATGAAATC GGTTTTACAG CCCTCTCAAT TACGTTGGTT
GATGTGGTGG TTTTTTTACC ACTTTCGTTA GTAAGTGGTA TTGTGGGCAA CATTTTACGC
GAATTTGCGG TGGTGATGGT TATTTCCACC ATGGTGAGCT TGTTGGTCTC CTTTACCTTA
ACCCCACTCT TAGCATCGCG CTTTTCACGC CTTAGTGCGT TTACGGGTAA AAATGTTGCA
GAACGTTTTG CCTTAGGTTT TGAAGAGCGC TTTCACGCCA TGCGTGAGTT CTATTTGCGC
TTACTTGGGT GGAGCTTGCG CAATCCCATT AAAGTTTTTT TAAGTGCTAT GGCACTGCTG
GTTGCCTCAT TCTCATTACT TGTAATGGGA GTGATTGGTG GAGAATTTAT TTCGGTATCG
GATAGAGGAG AATTTGCCGT CAAGCTTGAG CTTGAAGCAG GCACCTCGCT TGCCGAAACC
AATCGAATTA CGCGGCAGGT TGAGCATCGT TTAGAATCGT TACCTGAAGT ACGCGATGTT
ATGGTTACGG TTGGGGCTTC AAGCGAAGGC TTTTTTAACC AAAGCTCCGA CAATGTTGCT
GAACTCAACG TGCGCCTTTC ACCAAAAGAG GAGCGTTCCC GCTCAACCAA CGAGCTAATG
GCGCGAGTGC GTGCCGATTT GCAGCACCTG CCCGGTGTTA CCGCAACCAT TAACCCCATT
GGTATTTTTG GTACGGCAAA CGAAACTCCC GTAGCGGTAA TTTTTAGCGG ACCCGACCGC
AACGAAGTAG CGCGTGTTGC TGAAGCACTT AAAGCACGTT TAAGTACGGT GCCCGGTACG
GCTGACGTGG TGCTTACCTC AAAGCCCGGT AGCCCTGAAT TGCGCGTTGT TATTGACCGC
GAACAAATGG CAGCTTTTGG GCTTTCCATT GCTGATGTGG GCGCAGGCTT ACGAGTAGCT
TATGCGGGCG ATGAAGCAGG TAAATACCGT GATGGCGACG ACGAGCACAT TATTCGAGTG
ATGCTTGATG CTGGTTACCG CAACGATGTT GCCATGCTCT CCTCCATGAT TTTTCGTACA
CCTGAAGGCG ACATGGTGCC ACTTGGGCAA TTTGCCACGT TTGAGCAAGA ACGCGGCTAT
ACGCAATTAC AACGCAAAAA CCGCAACAAC GCCGTTTGGG TAAAAGCCCA AGTGATTGAC
CGTCCCGTTG GCGATATTGG GCAAGAGATT GAACGCATTA TTGCCGACAT GAAGCGCAAC
GGTACCATGC CCTCAACCGT TAGCTACGCT TACGAATCCG ACTTAAAGCA ACAACGCGAA
TCCAATTCCA CGCTTGGACT TTCGTTCATG GTTGCTATTG CCTTTGTTTA TCTCATTATG
GTTGCCCTTT ATGACTCTTG GTTTTGGCCA ATGGTGGTGA TGTTTTCCAT TCCGCTTGCC
ATTATTGGAG CACTCTTCTC GCTTGCTTTA ACCGGTAAAT CGCTCAGCAT GTTTACCCTT
CTTGGTATTA TTATGCTGAT TGGGCTTGTG GGCAAAAACG CCATTCTGTT AGTGGACTTT
ATTAACAACG CCCGTGCTGA AGGAATGGCG CTTACTACCG CTATTACCGA AGCCGCAAAA
GAGCGCCTTC GCCCAATTTT AATGACCACC TTTACCTTAA TTTTTGGTTT GTTGCCAATT
GCTCTTTCAG GCAGCTCAGG TTCAGAATGG AAGTCAGGAC TTGCTGTTGC GCTGGTTGGT
GGACTTTTTA GCTCAATGTT TTTAACCCTG CTGGTTATTC CCGTTGTTTA TGTTTGGTTC
GATCGCTTAC ACGAGCGAGT CTTAGCATTA CGTCAACGGA TTATGGGGTA A
 
Protein sequence
MTLTELSIKR PTLIVVFFTV LAILGLFSYR QLQYELLPKM TPPVVSISVV YPGASPSEVE 
TSLTKPLEEA VSALEKISSI SSTSTEGLSV VTIEFDNSAD IEQSLQDAQR KINEVSDLLP
TEAKTPVISR FALDEVPVLR IGATSSLPDS EFYQMLKDEV KPQLSSVAGV GQVYLVGGRE
REIRVNLDLD RMQAYGLTVT DVVREVEKAN SDFPTGAIEE TERQFVVRLA GKFRSLEELQ
ALIIQATPEG NVQLRDIATV EDGFREITTL SRLNGRESVG ILIMKQSDAN TVDVSRLVRA
SLSKIEQLYS NRNLRFSVAQ DASTFTLDAV TAVQHDLLLA VLLVAFVMML FLHSLRNSLI
VLVSIPTSMV TTFIGMYLFD FSLNLMTLLS LSLAVGVLVD DSIVVLENIY RHLEKGEEPR
HAAIVGRNEI GFTALSITLV DVVVFLPLSL VSGIVGNILR EFAVVMVIST MVSLLVSFTL
TPLLASRFSR LSAFTGKNVA ERFALGFEER FHAMREFYLR LLGWSLRNPI KVFLSAMALL
VASFSLLVMG VIGGEFISVS DRGEFAVKLE LEAGTSLAET NRITRQVEHR LESLPEVRDV
MVTVGASSEG FFNQSSDNVA ELNVRLSPKE ERSRSTNELM ARVRADLQHL PGVTATINPI
GIFGTANETP VAVIFSGPDR NEVARVAEAL KARLSTVPGT ADVVLTSKPG SPELRVVIDR
EQMAAFGLSI ADVGAGLRVA YAGDEAGKYR DGDDEHIIRV MLDAGYRNDV AMLSSMIFRT
PEGDMVPLGQ FATFEQERGY TQLQRKNRNN AVWVKAQVID RPVGDIGQEI ERIIADMKRN
GTMPSTVSYA YESDLKQQRE SNSTLGLSFM VAIAFVYLIM VALYDSWFWP MVVMFSIPLA
IIGALFSLAL TGKSLSMFTL LGIIMLIGLV GKNAILLVDF INNARAEGMA LTTAITEAAK
ERLRPILMTT FTLIFGLLPI ALSGSSGSEW KSGLAVALVG GLFSSMFLTL LVIPVVYVWF
DRLHERVLAL RQRIMG