Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1203 |
Symbol | |
ID | 3748237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 1597682 |
End bp | 1600849 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637773737 |
Product | hydrophobe/amphiphile efflux-1 HAE1 |
Protein accession | YP_379508 |
Protein GI | 78189170 |
COG category | [V] Defense mechanisms |
COG ID | [COG0841] Cation/multidrug efflux pump |
TIGRFAM ID | [TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0458875 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGAGC GTTTTATTTC CCGCCCTGTA CTTGCCACCG TTATCTCCAT TTTGTTAGTG ATTCTTGGTG TGGTAGGTCT TTCGCAATTG CCCGTTACCC GCTTCCCCGA TATTGCGCCG CCAAGCGTTT CAGTGTCGGC AACCTACCCG GGGGCAAGTG CTGAAACGGT GGCTCGTAGT GTGGCTCCAC CGCTTGAGGA GGTTATTAAC GGTGTTGAAA ACATGACCTA CATGACTTCA ACCTCTTCCA ATGACGGATC GCTAAACATT TCCGTTTTTT TTAAGCAAGG CACTAACCCC GATCAAGCTG CGGTTAATGT GCAAAATCGA GTGTCGCAAG CTACAAGCCG TTTGCCAGCA GAAGTAAATC AAATTGGTGT TAGTACGGTG AAGCGGCAGA ATAGCCAGAT TATGCTCATC AACTTAGCCA GCACCAATCC TGAGTATGAT GTGGTCTTTT TGCAGAATTA TGCAAAAATC AATCTTGTTG ACGATTTATC GCGTGTTCCG GGTGTTGGGC AGGTTTCGGT ATATGGCAAC CTCGATTACT CAATGCGCAT TTGGCTAAAG CCGCAAGTAA TGGCAGCTTA TAGCATAACG CCACAAGAGG TGCTATCGGC AATTCAAAGC CAAAACTTTG AAGCGGCGCC CGGCTCATTT GGTGAAAACA GCAATGAAGC CATGCAGTAC GTGATGCGCT ACAAAGGCAA GAATCGCTAT CCGGTGGAAT ATGAGCAAAT GGTGATTCGT GCGGGAGAGG ATGGCACTTT ACTGCGTTTG GGTGATGTTG CACGAGTGGA GTTTGGGGCG TACCGTTACG GCGTAAACAC CAAAGCAAAT GGTTTACCAG CAGTTGTGTT GGCTATTTTT CAAGCGCCCG GTTCCAACGC TAATGCCGTT GAAAGTTCCC TCCAAAAAGT GCTGCAAAAA GTATCTCCCA CCTTTCCCAA AGGCATTACT TACAGTATTC CATATAGTTC AAAAAAAGTT GTTGACGAGT CCATCACTCA AGTACTTCAT ACCTTAATAG AAGCCTTTTT GCTGGTCTTT CTCGTGGTCT TCATTTTTTT GCAAGATTTA CGCTCCACCT TAATTCCTGC AATTGCGGTG CCTGTGGCGC TTGTGGGCAC CCTCTTTTTT ATGAAGCTCT TTGGATTTTC CATTAACGTG CTGACGCTTT TTGGCTTAGT GCTCGCCATT GGCATTGTGG TGGACGATGC CATTGTGGTG GTTGAAGCGG TTCACGCTAA AATGGAGCAA AAGGGCTATG GCGCTAAAGT GGCAACGGTT TCGGCAATGC GCGATATCAC TAAAGCTATT GTAACCATCA CGTTGGTGAT GTCCTCCGTT TTTTTGCCTG CGGGATTTTT AGAGGGTTCA ACAGGCGTGT TTTATCGCCA ATTTGCTTTT ACGCTTGCTA TTGCCATTCT GCTTTCAGCG CTGAATGCCT TAACGCTTAG CCCAGCACTT TGCGCCCTTT TTCTTGAAAA AGAGCATCGC CACTCATCCA TTGGCAAGCG CTTTTTTACC TCCTTTAACA CGGCGTTTGA GGCAATGAAG CGCAAGTACC TTGGCGCCTT GCTCTACCTT CTTCGCCATA AAAAAGTTGC ATACAGTGGT TTGGCGCTGC TTACAGCACT TTCGTTTTGG ATGTTTAAAG CTACTCCCAC CGCATTTATT CCCGATGAGG ATAATGGCTT TGTAATTGTG TCGGCAACCA TGCCTCCGGG CGCTTCGTTT GCGCGTACCA AAGTGGTGAT GGATGACGCT GCTCGCACAT TGCAAGCTAT GCCAACGGTA AAAAAGGTGA TTGAGGTTGC AGGCATTAAC ATTTTAACCC GTACTTCGTC GCCCTCTTCA GGCTTGCTCT TTGTGCAGTT GCAAGATCAC AACGTGCGTG GCAAGCAGGG TGATATTAAG AATGTGATTG GGGCGATGTC GAAAAAATTA GCCAATCGCG AAGCCTCATT TTTTGTGTTG GCACAACCAA CGGTGCCGGG TTTTAGCACC GTTGGTGGAC TTGAATTAGT GCTGCAAGAT CGCAACGGAG TGGAACTTAG CCGCTTTAAT GAGATTGCTC AAAACTTTCT GAATGGCTTG CGGCAGCATC CCGCTATTGG GGTGGCATTT ACCAACTTTA AGGCAAATAA TCCTCAATAT GAGCTTGAAG TTGACCCCAT GCAGGCAAGC CAGCTTGGCG TAAGTACGCG CGATGTTATG AGCGTGCTGC AAGCTTACTA CGGTAGTGTG CAAGTCTCCG ATTTTAACCG CTTTGGCAAA TATTATCGTG TTATTATGCA AGCGGAACCA AGCGAACGCA CCGATGAATC CTCAATCAAT AGCATGTTTA TTCGCAATGT GCGTGGCGAT ATGGTGCCAC TCTCTTCGGT TGTGCGCTTA AAGCGAGTTT ATGGCGTTGA AGCCGTTGAC CACTTCAACC TCTTTAATGC TATTTCGGTT AATGCCGTTG CCAAACCCGG CTTTAGCACA GGGCAGGCAA TTCAAGCTGT TGACGATGTG GCACGTAAAA CGCTGCCCAC AGGTTTTACC TATGATTGGA AAGGGCAAAG CCGCGAAGAA ATTTCAGCCA GCGGAGGTTT ACTCCTCATT TTCTTGCTCT CCATTATCTT TGTCTATTTT TTGCTTGCGG CATTGTACGA AAGCTATCTT TTACCACTTG CGGTTATGCT CTCCATTCCA ACAGGATTGT TGGGAGTTTT TATTGGCATC AAATTAGCAG GCATTGCAAA CAACATTTAT GTGCAAGTTG CCATTATCAT GCTTATTGGT TTGCTGGCAA AAAATGCCAT TCTTATTGTG GAATTTGCCC TTCAGCGCCG CATTGCAGGG CGCCCACTTG CGGTTGCAGC CATTGAGGGC GCTCGTGCTC GCCTTCGCCC CATTCTTATG ACCTCGTTTG CTTTTATGGC TGGCTTGCTG CCACTGCTTT TTGTGTCGGG TCCAGCCGCT CAAGGTAACC ACTCCATTGG CGCTGCGGCG CTTGGCGGTA TGTTTGCGGG TTTAGTGTTT GGTATTCTTG TGGTGCCGCT GCTTTTTGTA ACGTTCCAAT ATCTGCAAGA GCGTATTACC GGTGTTGCAA AGCCAATTGT TGAAGCGGGG GATTTGTTGG ATGCTGTGGT GATGAAAGAG AAAGGAACTC ATGTGTAA
|
Protein sequence | MFERFISRPV LATVISILLV ILGVVGLSQL PVTRFPDIAP PSVSVSATYP GASAETVARS VAPPLEEVIN GVENMTYMTS TSSNDGSLNI SVFFKQGTNP DQAAVNVQNR VSQATSRLPA EVNQIGVSTV KRQNSQIMLI NLASTNPEYD VVFLQNYAKI NLVDDLSRVP GVGQVSVYGN LDYSMRIWLK PQVMAAYSIT PQEVLSAIQS QNFEAAPGSF GENSNEAMQY VMRYKGKNRY PVEYEQMVIR AGEDGTLLRL GDVARVEFGA YRYGVNTKAN GLPAVVLAIF QAPGSNANAV ESSLQKVLQK VSPTFPKGIT YSIPYSSKKV VDESITQVLH TLIEAFLLVF LVVFIFLQDL RSTLIPAIAV PVALVGTLFF MKLFGFSINV LTLFGLVLAI GIVVDDAIVV VEAVHAKMEQ KGYGAKVATV SAMRDITKAI VTITLVMSSV FLPAGFLEGS TGVFYRQFAF TLAIAILLSA LNALTLSPAL CALFLEKEHR HSSIGKRFFT SFNTAFEAMK RKYLGALLYL LRHKKVAYSG LALLTALSFW MFKATPTAFI PDEDNGFVIV SATMPPGASF ARTKVVMDDA ARTLQAMPTV KKVIEVAGIN ILTRTSSPSS GLLFVQLQDH NVRGKQGDIK NVIGAMSKKL ANREASFFVL AQPTVPGFST VGGLELVLQD RNGVELSRFN EIAQNFLNGL RQHPAIGVAF TNFKANNPQY ELEVDPMQAS QLGVSTRDVM SVLQAYYGSV QVSDFNRFGK YYRVIMQAEP SERTDESSIN SMFIRNVRGD MVPLSSVVRL KRVYGVEAVD HFNLFNAISV NAVAKPGFST GQAIQAVDDV ARKTLPTGFT YDWKGQSREE ISASGGLLLI FLLSIIFVYF LLAALYESYL LPLAVMLSIP TGLLGVFIGI KLAGIANNIY VQVAIIMLIG LLAKNAILIV EFALQRRIAG RPLAVAAIEG ARARLRPILM TSFAFMAGLL PLLFVSGPAA QGNHSIGAAA LGGMFAGLVF GILVVPLLFV TFQYLQERIT GVAKPIVEAG DLLDAVVMKE KGTHV
|
| |