Gene Cpha266_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1747 
Symbol 
ID4571109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1973891 
End bp1975072 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content53% 
IMG OID639766330 
Productsecretion protein HlyD family protein 
Protein accessionYP_912188 
Protein GI119357544 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01843] type I secretion membrane fusion protein, HlyD family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.570517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAGTC AAAAAGAAGA AGGAGAAAAA CAGCCTGTTG CGACTGAAAA GGCTGCCGAA 
GTTCACGTTA AAAGCTATCG GGATACCGGC AGTACGATCC GTCTCGGAAT ATGGATACTT
CTTGTCGGCT TTGGCGGTTT TCTGTTGTGG GCGGCATTTG CACCGCTTGA CGAAGGAGTT
CCCTGCCAGG GCGTTGTCAG TATTGCTACC AAGCGAAAGG TCGTTGAGCA TCTTCGAGGT
GGTACCGTTG AGAAGGTTGA GGTGCGGGAA GGTCAGATCG TACAGGAGGG AGAGGTGCTG
ATGAGACTTG ACAGCCAGAC GGCAAGGGCG CGGTACGATG AGATTCATCA GCACTATATC
GGTACAAGAG CTACGGCAGA CAGGCTGCTT GCGGAGATGA GCGGAGCAGG ATCAATTGCT
TTTCACCATG ATCTTCTTGC CGATCCGGAT CGAACTCTTG CCGAACAGAA TATGCGGACA
CAGAGGCAGC TTTTTCTCTC CCGCCAGGTT ACGCTAAGGA TTCTCAATGA ACAGCTCTCT
GGTATTGTCT CGCTTGTCAA GGAAGGTTAT GCCCCCTTGA GTCAGCAGCA CGAGCTGGAG
CTGAAGATTG CAGAGCTGAA GAGTGCAACA GCCTCGCAAC TCGCACAGGT GCAGCTTGAA
GTAGAAGCCG ACGCGGAAAA GACTCGAGCC CTTGCTGAAG AGCTTGCCGA TACTGAACTC
CGATCTCCCG CCTCAGGACA GGTGGTGGGC CTGCAAGTAC AGACCGTTGG TGCCGTGATT
CAGCCAGGTC AGAAGGTTAT GGATATCGTT CCGCTTCATG AAGGCCTGCT CATAGATGCC
AAGGTTGCCC CGCATCTGAT CGACAGCATC CGTAAGGGAC TGCCGGTGGA CGTGAGCTTT
TCCTCATTTG CACATGCGCC CCAGCTTGTT GTGCAGGCTG TGGTTGCTTC GATCTCAAAA
GATATTATTA CGGACCCGCA GACAAACCCG TCGATGCCTG GAGCCTCCTA TTACCTTGCC
CGGATTGCGG TGACCCCGCA TGGACTCAAC TCTCTCGGAA ATCGTCAGAT GCAGCCGGGA
ATGCCGGTAC AGGTGGTTAT TAAAACCGGG GAACGTTCCC TGCTGACCTA CCTGATTGAT
CCGCTGCTCA AGCGGATTAC AGTCTCCATG AAGGAGGAGT GA
 
Protein sequence
MHSQKEEGEK QPVATEKAAE VHVKSYRDTG STIRLGIWIL LVGFGGFLLW AAFAPLDEGV 
PCQGVVSIAT KRKVVEHLRG GTVEKVEVRE GQIVQEGEVL MRLDSQTARA RYDEIHQHYI
GTRATADRLL AEMSGAGSIA FHHDLLADPD RTLAEQNMRT QRQLFLSRQV TLRILNEQLS
GIVSLVKEGY APLSQQHELE LKIAELKSAT ASQLAQVQLE VEADAEKTRA LAEELADTEL
RSPASGQVVG LQVQTVGAVI QPGQKVMDIV PLHEGLLIDA KVAPHLIDSI RKGLPVDVSF
SSFAHAPQLV VQAVVASISK DIITDPQTNP SMPGASYYLA RIAVTPHGLN SLGNRQMQPG
MPVQVVIKTG ERSLLTYLID PLLKRITVSM KEE