Gene Acid345_4070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4070 
Symbol 
ID4072492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4816746 
End bp4817963 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content60% 
IMG OID637986101 
Productsecretion protein HlyD 
Protein accessionYP_593144 
Protein GI94971096 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.409958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.154431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTCG AACCTGAGCG TTCCATTCGT CTCGCTCGCG AAATTCAACC ATTCAGAAAA 
GCCGCACTCA CGCGGCTGGG CTTCGGTGCG GCAATCCCGA CTCTATTGCT GGTGCTCACT
CTCGGGTGCG ACAAGAAAGA GGAGGCAGTT GCTCCGCCTC CACCCGACGT GCAAGTGACC
GGCGTCGTGC AGCAGGACGT GCCCCTGTAT GGCGAGTGGG TCGCCACCCT CGATGGTTTT
GTGAACGCAC AGATCGCGCC ACAAGTCAGC GGCTACCTGA TGAAACAGAA CTACCGAGAG
GGTTCGGTGG TGAAAAAAGG CGACGTGCTC TTCGAGATCG ACCCACGGCC CTTTGAAGCC
GCGCTCGACC AGGCAAAGGG CAATTTCGCC GAAACCCAAG CCAAGCTCGG CAAGACTGAG
CTCGACGTAA AGCGCGACAC GCCACTGGCG GCGCAGAGCG CGATTCCGCA AGCGCAACTC
GACAACGACA TTCAAGCGAA CGAAGCCGCG AAAGCCATGA TCGTGGCGTC GCAAGCACAA
GTGCAGCAAG CGGAGTTGAA CGTCGGCTTC ACCAAAGTCC GCTCGCTGGT GGACGGAATC
GCCGGACTCG CAAAAGGACA GATTGGCGAC CTCGTCGGCC CGACAACGAT CCTCACCACC
GTTTCGCAAG TTTCGCCGAT CAAGGCTTAC GTCTCGATCA GTGAGCAGGA ATACCTTCGC
GCCGCGCAAA GAATCAGCAT GGTGTCTTCC GGACAACTCA GCCTCGACAA GATGCCGAGG
AACCTCGAAC TGATCCTCTC CGATGGCACG ACCTACAAAT ACAAGGGCTA CTTCGTGGTT
GCCGATCGCC AGGTTGATCT CAAGACCGGC ACGATTCGTC TCGCCGCTGC TTTCGACAAT
CCAGAAGGCA TCCTTCGGCC AGGACAATTC GCGCGCCTAC GCGTTGAAAC CCGTGTTGCG
AAGGACGCGC TCCTCGTCCC GCAGCGCGCG GTCGTCGAGA CCCAGGGCTC GTACAGCGTC
GTCGTTGTCG GTTCCGACAG CAAGGCAAGC ATTCGCCCGG TAAAGACCGG CGAGCGCGTC
GGCGAGTTGT GGATCATCAC CGAGGGCCTC AAGCCAGGCG AACAGGTCAT CGTCGAAGGC
ATGCAGAAAG CGAAGGAAGG CAGCCCGGTC AAAGCGGTAC AGGCGCAAGC GGAACCCACC
AAGGCCCAAG GAGACTAA
 
Protein sequence
MSLEPERSIR LAREIQPFRK AALTRLGFGA AIPTLLLVLT LGCDKKEEAV APPPPDVQVT 
GVVQQDVPLY GEWVATLDGF VNAQIAPQVS GYLMKQNYRE GSVVKKGDVL FEIDPRPFEA
ALDQAKGNFA ETQAKLGKTE LDVKRDTPLA AQSAIPQAQL DNDIQANEAA KAMIVASQAQ
VQQAELNVGF TKVRSLVDGI AGLAKGQIGD LVGPTTILTT VSQVSPIKAY VSISEQEYLR
AAQRISMVSS GQLSLDKMPR NLELILSDGT TYKYKGYFVV ADRQVDLKTG TIRLAAAFDN
PEGILRPGQF ARLRVETRVA KDALLVPQRA VVETQGSYSV VVVGSDSKAS IRPVKTGERV
GELWIITEGL KPGEQVIVEG MQKAKEGSPV KAVQAQAEPT KAQGD