Gene Acid345_2552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2552 
Symbol 
ID4072196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3010272 
End bp3011627 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content59% 
IMG OID637984569 
Productsecretion protein HlyD 
Protein accessionYP_591627 
Protein GI94969579 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.603084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATC AACGACTGGA AATCGAAGAG AAACATACCG ACGCCGAAAC CACGCACAAC 
ATTCGGCTCT GGATCATCGT TGCAGCAATT CTGCTCGTGA TCGTTTTTCT TGTCGGCTTT
GTGCCTCGAC ACGAGCGCAC CAAGCGGATC GGCGAGGACG CGAAGGAACG CCAGGGCCAG
CCTCCGACCG TGGATGTCAC CAAGGTACGG AGATCGGACG CCAAATCGCA CCTCAGTATT
CCCGGGACGA TCACGGCGGT AGTCGAAGCA CCCATCTACG CGCGGGCTTC CGGGTACATC
TCGAAGCGCA ATGTAGATTT TGGCGATCAC GTCCACGCAG GTGCATTGCT CGCAACTATT
GATGCGCCCG ACCTCGACCA GCAGGTGGAC CAGGCCCGCG CCACGTTGCT CCAAAGCGAA
TCAGTTCTTG GTCAACAACA AGCGCAGCTC AAACTTGCGA GCGTTACCTG GGACCGTTAC
AAAGTGCTGG TCCAACGCGG TGTCGCTTCC AAGCAAGAGG GCGACACGCA GGAAGCGACC
TACGAAGTTG CTATGGCAAA CGTGAAGGCT GCGGAAAATA GTGTCACGGC CAGTCGCGCC
AGCCTCGATC GTTTGCTGAA ACTGCAGAGC TACGAAAAAG TCACCGCGCC CTTTGAGGGT
ATCGTCACCG CACGCAACGT GGATGTGGGG ACCCTCATCT CCACGACAGG CGCGGGACAG
GGGAATGCGT CTGGCGCGGC GACCGGGCTT GCGCAGGGCG GAGAGATGTT CCGCGTTGCC
CAAATCAACC GTCTTCGCGT CTTCGTGAGT ATTCCTGAGT CTTACGCAGC CTTTGTACAG
ACCGGCCAGA ACGCAGACGT CACGGTGACC TCCGTGCCGA ACCAGAAGTT CGCCGGAAAG
GTTACGCGCA CCACGAACGC GGTTGATCCA GCCACTCGTA CGCTGCTCAC GGAAGTGCAA
ATTGATAACC GAGAAGGCAA GCTGCTGCCG GGCATGTACG GCACGATTAC TTTCGAGAGT
GTCCGCACCA TGCCGCCGCT CGTGATTCCG TCGGACGCGC TGATTTACCG ATCGCAAGGC
ACGATGGTCG CCACGGTACA GGACAATATT GTTCACCTCG TGCCGATCAA GGTAGGACGC
GACTTCGGCT CCCAGCTTGA AATCGTCGAA GGGCTGAACG AAGGCGATTT TGTCGCCATC
AATCCGAGCG ATGTCGCTCG CGACGGCGCC AAAGTGACGC CGCATGAACT CGCATCTCAG
AACAACGCCC GGCCGGGCGC TGCGCAACCA CCCAGTGGTC AGCAAAACAA CGGTCAAGGC
CAACAAGGCG CAGGGAAGAA GAACTCCGGC CAATGA
 
Protein sequence
MSNQRLEIEE KHTDAETTHN IRLWIIVAAI LLVIVFLVGF VPRHERTKRI GEDAKERQGQ 
PPTVDVTKVR RSDAKSHLSI PGTITAVVEA PIYARASGYI SKRNVDFGDH VHAGALLATI
DAPDLDQQVD QARATLLQSE SVLGQQQAQL KLASVTWDRY KVLVQRGVAS KQEGDTQEAT
YEVAMANVKA AENSVTASRA SLDRLLKLQS YEKVTAPFEG IVTARNVDVG TLISTTGAGQ
GNASGAATGL AQGGEMFRVA QINRLRVFVS IPESYAAFVQ TGQNADVTVT SVPNQKFAGK
VTRTTNAVDP ATRTLLTEVQ IDNREGKLLP GMYGTITFES VRTMPPLVIP SDALIYRSQG
TMVATVQDNI VHLVPIKVGR DFGSQLEIVE GLNEGDFVAI NPSDVARDGA KVTPHELASQ
NNARPGAAQP PSGQQNNGQG QQGAGKKNSG Q