Gene Acid345_1469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1469 
Symbol 
ID4069619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1777433 
End bp1778485 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content57% 
IMG OID637983478 
Producttype IV pilus assembly protein PilM 
Protein accessionYP_590545 
Protein GI94968497 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4972] Tfp pilus assembly protein, ATPase PilM 
TIGRFAM ID[TIGR01175] type IV pilus assembly protein PilM 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.11937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.330674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGAC TGGGAACAGC AAAGAGCATA GTTGGCCTGG ATATCGGATC CAGCAGCATC 
AAGGCCGTGG AGTTAAAGAA GTCGCGCAAT GGCGTAGAAG TGGCGCACAT GGCCATGGAG
CCCCTGTCGT CTGACATCGT CGTGGACTCG ATGATTGTGG ACAGCGGCAG CGTCGCCAGC
GCAATTACCA AGATCTTTAC GGAGTCGGGC ATCAAGACTC GTGCGGTAGC GACCTCGGTC
AGCGGACACT CCGTGATCGT GAAGCGCATC CCGATGTCGA CGATGAGCGA CTCTGAACTT
TCCGGCATCA TCCAGACCGA AGCCGCGCAA CACATCCCGT TCGATATCTC GGACGTCAGC
ATTGACTACC AGATCCTTTC CGACACCGGT GGCTCGACGA TGGACGTCCT GCTGGTCGCG
GTGAAGAAAG ACAAAATTCT TAACTACACG AACGTTCTGT CGCTCGCCGG CAAGTCTCCG
GCGGTGGTGG ACATCGACGC GTTCGCCCTC CAGAACTGCT ACGAATACAA CTATCAACCC
GGTCCGGGCG CGACAGTTGC GTTGTTGAAT CTCGGCGCCA GCGTAATGAA CATCAACATC
GTGAAGGGCA CCACACCCCT GTTCACGCGC GATGTGAGCG TCGGCGGCCA CCAATACACC
GATTCGTTGC AGAAGGAACT GGATCTCAGC TTTGAAGACG CGGAAGCGCT GAAGCTCGGT
AAGAAAGTGG GCACAGTCAG CGAAGACGCG AAGATGCCGA TCCTCCAGCA AGTGACCGAA
ATCATCGTGC TGGAAATTCA GAAGACTTTC GACTTCTTCC GCGCTACCGC GACGGGAGAG
CACATTGAGC GCATTTACCT CGCGGGCGGT TCGTCGCAGG TGCCGGGCCT GATTGAAGGC
CTGCGCCAGG AGTTCTCGCT CCCAGTCGAG ATCCTCAATC CATTCCAGCG CATTGAACCG
CCTCTTGGCA CGGGCGCGGA TCTCGCCGAC AAGAACGCCG GCCAGATGGC AGTTGCCGTG
GGACTCGCCC TTAGGAGTTT TGACGAATTA TGA
 
Protein sequence
MFGLGTAKSI VGLDIGSSSI KAVELKKSRN GVEVAHMAME PLSSDIVVDS MIVDSGSVAS 
AITKIFTESG IKTRAVATSV SGHSVIVKRI PMSTMSDSEL SGIIQTEAAQ HIPFDISDVS
IDYQILSDTG GSTMDVLLVA VKKDKILNYT NVLSLAGKSP AVVDIDAFAL QNCYEYNYQP
GPGATVALLN LGASVMNINI VKGTTPLFTR DVSVGGHQYT DSLQKELDLS FEDAEALKLG
KKVGTVSEDA KMPILQQVTE IIVLEIQKTF DFFRATATGE HIERIYLAGG SSQVPGLIEG
LRQEFSLPVE ILNPFQRIEP PLGTGADLAD KNAGQMAVAV GLALRSFDEL