Gene Cphamn1_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1066 
Symbol 
ID6374740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1151299 
End bp1154406 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content51% 
IMG OID642683568 
Productacriflavin resistance protein 
Protein accessionYP_001959486 
Protein GI189500016 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0284295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000200954 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATCAC TGTTCCGTTT TTTTGCTGAG CGTCACATGC TCGCGTACCT CATGACGATC 
CTGGTCTTTC TGTTCGGCAT GGCTACGCTT TGGCAGATCA ACCGGGCTCA GTATCCGAAG
GTGGACCTGG GTCAGATGGT GGTGACGACA AATTATCCCG GGGCTGCGCC GGAGGACGTC
GAACTCAATG TCACCAACAA GATCGAAGAT GAGCTGAAAA GTGTCACCGA TATCAAGCGG
GTCTTTTCGA TGTCGATGGA AAATGTCTCT ATCGTTATTG TCGATATAGA GCCGGATGCT
TCCGATGTTG ACGGGGTGAA ACGAGAGATT CGTGAAGCGG TTTTTCGAAT AACGGATTTT
CCAGAGGAGG TGACGGAATC TTCACTGATC ACGGATATCA AATCATCTAT TTTCCCTATT
CTTGAAGTTG GTCTGACGGG TGATAAGCCC TATCCGGAAT TGCGGGAGCT TGCCCGCAGG
TTCGAAAAGA AGCTCAAGGA CATTCCGGGC GTGGCGAGCG TGCAGCGTTA CGGTTACCGC
GACAGGGAGA TACAGGTCGA GCTGTATCCC GGGAAAATAA GAGAGCTTCA GGTTCCCATG
CAGGAAGTTG TGGATGCTAT TCAGCAGCGA AACATACGGG CAACCGGAGG TTCGCTTGAA
TCGTTTACCA GTGAGAAAAA CCTTGTTACC CTTGCCCAGT TCCGCGATCC CCGGGAAGTT
GGAGATGTTG TCGTGCGTTC AAGTTTCGAC GGACCGATTA TCAAGGTTGC CGATATAGCC
GATGTTACCG ACAGTTTCGA AGAAGAGCGG GTTTTGTCGA GAATCAACGG GCATCCGGCT
ATCTCGTTTC TGATCAACAA AAGTGAATCG GCTGATATTA TTCGTACGGT AGAGGCGATC
AGGGATCTGG TCAGTGAAGA GGAGGCTCTG CTGCCGGAGG GCGTAAGGTT TATTTATGGC
CTCGACTTTT CACAATATGT GGCCAACCAG CTTACCATTG TCATGACAAA CGGCGGAATC
GGTCTTGTGC TCGTCATGAT CGTTCTTGCC CTGTTTCTCA ATATCCGGAC CGCATTCTGG
GTGGCTCTCG GTATTCCTTT TACGCTGCTT GGCGGTATTT CCCTGCTTCC GCTCTTCGAT
GTCGAGCTCG ATACGGTGAC GCTTACCTCC CTGATCATCG TCATCGGTAT TGTTGTCGAT
GATGCCATTA TCATATCCGA AAACATTTTT CAGCGTCGCG AACGCGGCGA GTCTCCTATC
GAAGCGGTGG TCAACGGTAT CTATCAGGTG TACAAGCCTG TGCTGACGAC TGTTCTGACA
ACCTTTCTTG CGTTCGCTCC AATGTTTTTC ATGCCGGGCA TCCTCGGTAA ATTTGTTTTC
GTGATCCCGT TGACCATTTC GCTTGCTCTT TTTGTTTCAC TCATCGAAGC ATTCCTGATT
CTCCCGGCAC ATGTGATGCC CGGACTGTAT ACGAGAGAGG GAGAAGATCC CAAATCCACC
ATGCGCAACT GGTTTGTACC GATACGTGAC ATGTTTGAAA AGGTTCTTCA TTCAATGCTG
CGTTTCCGGT ACGTTTTGGT ATTTTTTGCA CTGCTCTCGT TTGGCGGGGC TCTGTTTTAC
GGCCTGAATT ATATCAGCTA TATTCTTTTT CCGACAAAAG GATCCGATGC GTTCAATATC
TGGATCGAGG TTCCTGTTGG AAGCTCTCTC AAGACAACCT CTGATAAAGC CGCCGAGTTT
GAAAAATTAA TAGGGGTTTT GCCTGAAGAT GAGCTGGATG CCTATCTGAC AAGAATCGGA
ACACAGGCGG ACATTGTCCC GATCGAGCAG GAAAATTTTG CCGAGCTTGC CGTCAAGCTT
ACACCATACG GGACACGGGA AAGATCAGCG GATGAAATTG TGGACGATAT CCGTGAAAAA
GCTAAAGAGA TCACCGGTAT CTCGCAGACA ACTTTTTTCG TTGAATCCGG AGGTCCCCCT
GTCGGTAAGC CTGTCACGAT ACGTGTTGTC GGTTCCGACG ATTCGCTGAG GACTGCTCTT
GCCGATTCAG TCTTCAACTA TCTCGGCGCA ATTCAAGGTG TTACTGACCC GGATCGCAAT
GACAAGGATG GTAAGGAGCA GATTGAAATA GCAATACGCC ATGACCGCCT TGCCCGGCTC
GGATTGTCGG TGGCGGATAT TGCCAGAACG GTCAGAACGG CCTATGACGG TCAGGTGGTC
ACCAGCGTTC GTTACGGTGA AGAGGAGGTT GATTTCAGGG TGATGCTCAA GAAGTTTGCC
AGAAAGAAGC TCGAATACCT TCAGGATCTT TCTATTCCCA ACAGGACGGG CAGGTTGATT
CCGTTGAAAG AGGTCGCCAG TTTCGAGCAG GGTTCGGGCC CATCCATATT TCATCACTAT
GATGGAGAGC GGTCAGTGAC GATTTCGGCA GATGTTAAAC AGGACACGGT CACGCCAATC
GAGGTGATGA GGAATGTGGA AACGCATTTT ACGGGAACCC GCGATTTCCC AGGAATCAAA
CTGGTGTTCG GTGGAGAAGC CCAGGAGTCG GAAGAATCGC TCCGGGGATT GTTCATCGCT
TTCGGTGTGG CTGCTTTCGG GATTTACTTC CTGCTCATTC TCCTGTTCAA CTCCGTTACA
CAGCCGCTGC TTGTGATGAT GTCGATACCG TTTGCCATCA TCGGTATCGT GATCACCTTT
GCGCTGCACG GTGAGGTATT CAGCTTCCTC GGGTTGCTCG GTGTTGTCGG TATGGCCGGT
GTCGTGGTCA ACGACTCGCT TGTCCTGGTA AACTACCTCA ACGAGCTTTA CCAGTCCGGG
CAAACAAGGG ATATTCCTGC CCTGGTTGCA AAAGGCACCG CTGATCGTCT GCGTGCCATT
CTGCTGACAA CGGTGACAAC TGCCGCCGGT CTTCTGCCGC TAGCGTACGG TATCGGCGGA
ACGGATGCAT CGATGATGCC GATGGCACTC GCGCTGGGCT GGGGGCTGCT GCTCGCTACC
CCTTTGACGT TAGTGCTGAT TCCCTGTCTG TACATGATCG GTTTTGATAT AAGAGATCTG
TTGTCAAGGG TTTCCGGTGG AGGTAAAGGG CTCCCTGAAG ATGCGTGA
 
Protein sequence
MKSLFRFFAE RHMLAYLMTI LVFLFGMATL WQINRAQYPK VDLGQMVVTT NYPGAAPEDV 
ELNVTNKIED ELKSVTDIKR VFSMSMENVS IVIVDIEPDA SDVDGVKREI REAVFRITDF
PEEVTESSLI TDIKSSIFPI LEVGLTGDKP YPELRELARR FEKKLKDIPG VASVQRYGYR
DREIQVELYP GKIRELQVPM QEVVDAIQQR NIRATGGSLE SFTSEKNLVT LAQFRDPREV
GDVVVRSSFD GPIIKVADIA DVTDSFEEER VLSRINGHPA ISFLINKSES ADIIRTVEAI
RDLVSEEEAL LPEGVRFIYG LDFSQYVANQ LTIVMTNGGI GLVLVMIVLA LFLNIRTAFW
VALGIPFTLL GGISLLPLFD VELDTVTLTS LIIVIGIVVD DAIIISENIF QRRERGESPI
EAVVNGIYQV YKPVLTTVLT TFLAFAPMFF MPGILGKFVF VIPLTISLAL FVSLIEAFLI
LPAHVMPGLY TREGEDPKST MRNWFVPIRD MFEKVLHSML RFRYVLVFFA LLSFGGALFY
GLNYISYILF PTKGSDAFNI WIEVPVGSSL KTTSDKAAEF EKLIGVLPED ELDAYLTRIG
TQADIVPIEQ ENFAELAVKL TPYGTRERSA DEIVDDIREK AKEITGISQT TFFVESGGPP
VGKPVTIRVV GSDDSLRTAL ADSVFNYLGA IQGVTDPDRN DKDGKEQIEI AIRHDRLARL
GLSVADIART VRTAYDGQVV TSVRYGEEEV DFRVMLKKFA RKKLEYLQDL SIPNRTGRLI
PLKEVASFEQ GSGPSIFHHY DGERSVTISA DVKQDTVTPI EVMRNVETHF TGTRDFPGIK
LVFGGEAQES EESLRGLFIA FGVAAFGIYF LLILLFNSVT QPLLVMMSIP FAIIGIVITF
ALHGEVFSFL GLLGVVGMAG VVVNDSLVLV NYLNELYQSG QTRDIPALVA KGTADRLRAI
LLTTVTTAAG LLPLAYGIGG TDASMMPMAL ALGWGLLLAT PLTLVLIPCL YMIGFDIRDL
LSRVSGGGKG LPEDA