Gene Cag_1203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1203 
Symbol 
ID3748237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1597682 
End bp1600849 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content47% 
IMG OID637773737 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_379508 
Protein GI78189170 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0458875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAGC GTTTTATTTC CCGCCCTGTA CTTGCCACCG TTATCTCCAT TTTGTTAGTG 
ATTCTTGGTG TGGTAGGTCT TTCGCAATTG CCCGTTACCC GCTTCCCCGA TATTGCGCCG
CCAAGCGTTT CAGTGTCGGC AACCTACCCG GGGGCAAGTG CTGAAACGGT GGCTCGTAGT
GTGGCTCCAC CGCTTGAGGA GGTTATTAAC GGTGTTGAAA ACATGACCTA CATGACTTCA
ACCTCTTCCA ATGACGGATC GCTAAACATT TCCGTTTTTT TTAAGCAAGG CACTAACCCC
GATCAAGCTG CGGTTAATGT GCAAAATCGA GTGTCGCAAG CTACAAGCCG TTTGCCAGCA
GAAGTAAATC AAATTGGTGT TAGTACGGTG AAGCGGCAGA ATAGCCAGAT TATGCTCATC
AACTTAGCCA GCACCAATCC TGAGTATGAT GTGGTCTTTT TGCAGAATTA TGCAAAAATC
AATCTTGTTG ACGATTTATC GCGTGTTCCG GGTGTTGGGC AGGTTTCGGT ATATGGCAAC
CTCGATTACT CAATGCGCAT TTGGCTAAAG CCGCAAGTAA TGGCAGCTTA TAGCATAACG
CCACAAGAGG TGCTATCGGC AATTCAAAGC CAAAACTTTG AAGCGGCGCC CGGCTCATTT
GGTGAAAACA GCAATGAAGC CATGCAGTAC GTGATGCGCT ACAAAGGCAA GAATCGCTAT
CCGGTGGAAT ATGAGCAAAT GGTGATTCGT GCGGGAGAGG ATGGCACTTT ACTGCGTTTG
GGTGATGTTG CACGAGTGGA GTTTGGGGCG TACCGTTACG GCGTAAACAC CAAAGCAAAT
GGTTTACCAG CAGTTGTGTT GGCTATTTTT CAAGCGCCCG GTTCCAACGC TAATGCCGTT
GAAAGTTCCC TCCAAAAAGT GCTGCAAAAA GTATCTCCCA CCTTTCCCAA AGGCATTACT
TACAGTATTC CATATAGTTC AAAAAAAGTT GTTGACGAGT CCATCACTCA AGTACTTCAT
ACCTTAATAG AAGCCTTTTT GCTGGTCTTT CTCGTGGTCT TCATTTTTTT GCAAGATTTA
CGCTCCACCT TAATTCCTGC AATTGCGGTG CCTGTGGCGC TTGTGGGCAC CCTCTTTTTT
ATGAAGCTCT TTGGATTTTC CATTAACGTG CTGACGCTTT TTGGCTTAGT GCTCGCCATT
GGCATTGTGG TGGACGATGC CATTGTGGTG GTTGAAGCGG TTCACGCTAA AATGGAGCAA
AAGGGCTATG GCGCTAAAGT GGCAACGGTT TCGGCAATGC GCGATATCAC TAAAGCTATT
GTAACCATCA CGTTGGTGAT GTCCTCCGTT TTTTTGCCTG CGGGATTTTT AGAGGGTTCA
ACAGGCGTGT TTTATCGCCA ATTTGCTTTT ACGCTTGCTA TTGCCATTCT GCTTTCAGCG
CTGAATGCCT TAACGCTTAG CCCAGCACTT TGCGCCCTTT TTCTTGAAAA AGAGCATCGC
CACTCATCCA TTGGCAAGCG CTTTTTTACC TCCTTTAACA CGGCGTTTGA GGCAATGAAG
CGCAAGTACC TTGGCGCCTT GCTCTACCTT CTTCGCCATA AAAAAGTTGC ATACAGTGGT
TTGGCGCTGC TTACAGCACT TTCGTTTTGG ATGTTTAAAG CTACTCCCAC CGCATTTATT
CCCGATGAGG ATAATGGCTT TGTAATTGTG TCGGCAACCA TGCCTCCGGG CGCTTCGTTT
GCGCGTACCA AAGTGGTGAT GGATGACGCT GCTCGCACAT TGCAAGCTAT GCCAACGGTA
AAAAAGGTGA TTGAGGTTGC AGGCATTAAC ATTTTAACCC GTACTTCGTC GCCCTCTTCA
GGCTTGCTCT TTGTGCAGTT GCAAGATCAC AACGTGCGTG GCAAGCAGGG TGATATTAAG
AATGTGATTG GGGCGATGTC GAAAAAATTA GCCAATCGCG AAGCCTCATT TTTTGTGTTG
GCACAACCAA CGGTGCCGGG TTTTAGCACC GTTGGTGGAC TTGAATTAGT GCTGCAAGAT
CGCAACGGAG TGGAACTTAG CCGCTTTAAT GAGATTGCTC AAAACTTTCT GAATGGCTTG
CGGCAGCATC CCGCTATTGG GGTGGCATTT ACCAACTTTA AGGCAAATAA TCCTCAATAT
GAGCTTGAAG TTGACCCCAT GCAGGCAAGC CAGCTTGGCG TAAGTACGCG CGATGTTATG
AGCGTGCTGC AAGCTTACTA CGGTAGTGTG CAAGTCTCCG ATTTTAACCG CTTTGGCAAA
TATTATCGTG TTATTATGCA AGCGGAACCA AGCGAACGCA CCGATGAATC CTCAATCAAT
AGCATGTTTA TTCGCAATGT GCGTGGCGAT ATGGTGCCAC TCTCTTCGGT TGTGCGCTTA
AAGCGAGTTT ATGGCGTTGA AGCCGTTGAC CACTTCAACC TCTTTAATGC TATTTCGGTT
AATGCCGTTG CCAAACCCGG CTTTAGCACA GGGCAGGCAA TTCAAGCTGT TGACGATGTG
GCACGTAAAA CGCTGCCCAC AGGTTTTACC TATGATTGGA AAGGGCAAAG CCGCGAAGAA
ATTTCAGCCA GCGGAGGTTT ACTCCTCATT TTCTTGCTCT CCATTATCTT TGTCTATTTT
TTGCTTGCGG CATTGTACGA AAGCTATCTT TTACCACTTG CGGTTATGCT CTCCATTCCA
ACAGGATTGT TGGGAGTTTT TATTGGCATC AAATTAGCAG GCATTGCAAA CAACATTTAT
GTGCAAGTTG CCATTATCAT GCTTATTGGT TTGCTGGCAA AAAATGCCAT TCTTATTGTG
GAATTTGCCC TTCAGCGCCG CATTGCAGGG CGCCCACTTG CGGTTGCAGC CATTGAGGGC
GCTCGTGCTC GCCTTCGCCC CATTCTTATG ACCTCGTTTG CTTTTATGGC TGGCTTGCTG
CCACTGCTTT TTGTGTCGGG TCCAGCCGCT CAAGGTAACC ACTCCATTGG CGCTGCGGCG
CTTGGCGGTA TGTTTGCGGG TTTAGTGTTT GGTATTCTTG TGGTGCCGCT GCTTTTTGTA
ACGTTCCAAT ATCTGCAAGA GCGTATTACC GGTGTTGCAA AGCCAATTGT TGAAGCGGGG
GATTTGTTGG ATGCTGTGGT GATGAAAGAG AAAGGAACTC ATGTGTAA
 
Protein sequence
MFERFISRPV LATVISILLV ILGVVGLSQL PVTRFPDIAP PSVSVSATYP GASAETVARS 
VAPPLEEVIN GVENMTYMTS TSSNDGSLNI SVFFKQGTNP DQAAVNVQNR VSQATSRLPA
EVNQIGVSTV KRQNSQIMLI NLASTNPEYD VVFLQNYAKI NLVDDLSRVP GVGQVSVYGN
LDYSMRIWLK PQVMAAYSIT PQEVLSAIQS QNFEAAPGSF GENSNEAMQY VMRYKGKNRY
PVEYEQMVIR AGEDGTLLRL GDVARVEFGA YRYGVNTKAN GLPAVVLAIF QAPGSNANAV
ESSLQKVLQK VSPTFPKGIT YSIPYSSKKV VDESITQVLH TLIEAFLLVF LVVFIFLQDL
RSTLIPAIAV PVALVGTLFF MKLFGFSINV LTLFGLVLAI GIVVDDAIVV VEAVHAKMEQ
KGYGAKVATV SAMRDITKAI VTITLVMSSV FLPAGFLEGS TGVFYRQFAF TLAIAILLSA
LNALTLSPAL CALFLEKEHR HSSIGKRFFT SFNTAFEAMK RKYLGALLYL LRHKKVAYSG
LALLTALSFW MFKATPTAFI PDEDNGFVIV SATMPPGASF ARTKVVMDDA ARTLQAMPTV
KKVIEVAGIN ILTRTSSPSS GLLFVQLQDH NVRGKQGDIK NVIGAMSKKL ANREASFFVL
AQPTVPGFST VGGLELVLQD RNGVELSRFN EIAQNFLNGL RQHPAIGVAF TNFKANNPQY
ELEVDPMQAS QLGVSTRDVM SVLQAYYGSV QVSDFNRFGK YYRVIMQAEP SERTDESSIN
SMFIRNVRGD MVPLSSVVRL KRVYGVEAVD HFNLFNAISV NAVAKPGFST GQAIQAVDDV
ARKTLPTGFT YDWKGQSREE ISASGGLLLI FLLSIIFVYF LLAALYESYL LPLAVMLSIP
TGLLGVFIGI KLAGIANNIY VQVAIIMLIG LLAKNAILIV EFALQRRIAG RPLAVAAIEG
ARARLRPILM TSFAFMAGLL PLLFVSGPAA QGNHSIGAAA LGGMFAGLVF GILVVPLLFV
TFQYLQERIT GVAKPIVEAG DLLDAVVMKE KGTHV