Gene Acid345_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2002 
Symbol 
ID4070908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2398731 
End bp2399924 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content56% 
IMG OID637984016 
Producthypothetical protein 
Protein accessionYP_591077 
Protein GI94969029 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4966] Tfp pilus assembly protein PilW 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00963424 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGC GCGGCGCAAA AGGTTTCACG TTGGTCGAAC TGCTGGTGGC AATCAGCCTT 
GGATTGCTGG TTACCGGTGC AGCAGTCGCT GTCTACAAAC AGGCAGTAGA CAGTACGACC
TACCTCACGC AGCGAACTGT CGTCCAAGGC AATGCTCGGG CTGCGATGAA TACGATCTCG
CAGGACCTGA ACTTTGCCGG GTATGGGCTA CCAATCGGCG GAATTCCAGT GCCGACGGCC
GCGCTCTTTA GCTGCGCTAC TGGAAGCGCC GCCGCGTCCT GGGCATATAG CTGCCCGACG
ACTGCGCCAT CGTTCCCGGT TATCTCGGGC GCCGCGACCA TGTCCGGCAT CACGCCGATG
TATCAGGCCG GCCCGACGAT CAACGGAAAC GCAACTGACC AGATGGCAAT GGCTTATGTT
GACAGCTCTC CAAATTTTTC CAACAACACT TGCGGGGTTT CTACGCAATG TGGGTTTGAC
GCATTCCCAT TGACGCAGGC GTCGGTGTCT GGCAGTACAA CGACGCTTTA CTTCAACGGT
TCGACTACTC CGGCGCCAAA CGATACCAAG TGGGGTTTGA AGGTTGGGGA CATCCTGCTG
GTCTCGAACT CCACCGGCCA GGCCGTAGGA GAAGTCACGA GCGTAACCTC GGGCAATGTT
GTGCTCGCTG CAAGTGATCC GATGAAGTTG AATCAGGCGT TCGGGACAGG CGGATCGGTG
CCCAATGTCC TCGGATTCAG TTCAGGTATC CAAGCTTACA ACAACGGAGC AGGCCCCCTG
CAGTCAACAA ACGTGAAGCG TCTGTACATC GTGACTTACT ACGTAGCCAC AGATCCTTTG
GCGCAGGCGG TGGGAACAAC TGGAAATCCG ACGCGCCTGT ACCGGATGGT GAACGGCGAT
TCCAATACCA ATCCCCCGGT TCCAGTGGCA GAACAGATTT CCAATCTGAC CTTCAGCTAC
AACATGTTCG ATTCTGTCTG TGGCGGTTCA CAGTCCGCCA ACCAACGCAA TCCGACAACG
AACCAAATCG GCTTGATCAA GACGATCAAT GCCAGCATTT TTGCGGCGAG CACACTCAAC
ACGACAGCTA TTCCCGGCCA GGCCATCCAG CAGATTCCGA TGACCACCAC AGTTTCGCCA
AGGAACCTCA GTTATTTCGA CTCGTATTCT TCGACGCCGC AAGGCAGTTG CTAA
 
Protein sequence
MKMRGAKGFT LVELLVAISL GLLVTGAAVA VYKQAVDSTT YLTQRTVVQG NARAAMNTIS 
QDLNFAGYGL PIGGIPVPTA ALFSCATGSA AASWAYSCPT TAPSFPVISG AATMSGITPM
YQAGPTINGN ATDQMAMAYV DSSPNFSNNT CGVSTQCGFD AFPLTQASVS GSTTTLYFNG
STTPAPNDTK WGLKVGDILL VSNSTGQAVG EVTSVTSGNV VLAASDPMKL NQAFGTGGSV
PNVLGFSSGI QAYNNGAGPL QSTNVKRLYI VTYYVATDPL AQAVGTTGNP TRLYRMVNGD
SNTNPPVPVA EQISNLTFSY NMFDSVCGGS QSANQRNPTT NQIGLIKTIN ASIFAASTLN
TTAIPGQAIQ QIPMTTTVSP RNLSYFDSYS STPQGSC