Gene Acid345_2522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2522 
Symbol 
ID4069891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2977784 
End bp2979643 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content59% 
IMG OID637984539 
ProductTPR repeat-containing protein 
Protein accessionYP_591597 
Protein GI94969549 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1450] Type II secretory pathway, component PulD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.417958 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACAG CTCTTTGGGG TGCGATTCTA CTGGCTGCCG CGAGCGTCGC GGCACAGGAC 
CTGCCGTCTC CGAATACACC GGCGGTTGAG GCCAAGGTGC CGTCGCAGAA AGCGGAAAAG
CCCTCGAAGA GCGAAGAAAA ACAGGCGAAA CACGAGTTCT CAGAAGGGAT GAAGCGGCAA
AAGGCCGGAG ACTTGCAGCA GGCCTACGAA CATTTCGAAG CCGCTTCCCG TCTGCTGCCG
AAAAATGTTG AATACGCCAC TGCCCGCGAA CTTACGAAAC AACAAGCGGT AATGCAGTTC
ATTCAGCGCG GCAACACCGC GATGGAAAAG GGCCTGACTA TCGAAGCCCA AGGCGACTAT
CGCATGGCGC TGCAGATCGA CCCCGACAAC AACTTCGCGC AGCAGCAGCT CAAGAACGCG
CTCCCCGCGC TCCCATCAAC CGGTTCTCAG ATCCGCTTCA GCGATTCGAC TGACGATGCC
GAGTATCGCG CGCCGGTCAT GCTTGCTCCC GAAAAGGTAA AGAAGGACTT CCACTATCGT
GGCGACTCGA AGGGTCTGCT CACGCAGGTG ATGCAAGCCT ACGGCGTGAC CGCCACGCTG
GATGATTCCG TGCCCTCGAA GCGAGTCCGC TTCGACTTGG ATGCGACGGA TTTCGAGCAT
GCCACCGAGG CCGCAGGCAC CATCACGAAA ACTTTTTGGG TGCCACTTGG ACTGCGCCAA
GTCTTGGTAC TCGCCGACAC GCCGGCAAAC CGTCGCGACA ACCAGCACAT GGTTCTGCGC
ACCTTTTACT TCCCTGACGC GACCACGCCG ACCGACCTTC AGGACCTGAT TAACGTCTTC
CGCGTGATCT TCGACGTCCG CTTTGTGGTG CCGCAACCCA GCAGGAACAG CATTACGGTG
CGGGCCCCGC AGCCTACCAT GGAAGCCGTT ACCCAGTTCT TTTCCGATCT CGACGCCTCC
CGTCCGCAAG TTGCGCTGGA TGTCAGCGTC TATCGCGTAA GCGGTACCCT GACCCACCAG
TTAGGAGTTC AGCCGCCGAA TCAATTCACG ACGTTTAACC TCGGCAGCGT GCTCGCAGGA
CTCGGCGCCA CGAATTTGCA GTCGCTAATC AATCAGATCA TTTCTTCCGG CGCGATCAAC
CAGGCGAACG GTACCGACAT CTCGGCGCTT ATCACACAAG CCCTCGGCAA CACGCAACTG
GCCACGCTTT TCCAGACGCC GTTCGTTACC TACGGCGGAG GCCTGACGCT GATGGCACTC
ACCGTGCCCG GCACCACGCT CAATCTCAAC TTCAGTAAGG CGAATTTCCA GAACCTCGCG
CACATGCAGT TGCGAGCATC GCAGAACAAC GCGGCAACCA TGCGTATCGG TGAGCGTTAC
CCGATTCTGA ACGCGACCTT TGCGCCGATC TACAACACTC CACAAATCAG TGCTCTGTTG
CGGACGGGAA CCTACGTAGC ACCCTTCCCA TCGTTCAATT ACGAGGACCT CGGTTTGACG
GTGAAAGCCA CGCCCTCGAT CCAAGGCAAT CGCGACGTGC GCCTCAACCT CGAGATGCAG
ATGCGCTCGC TCGGCGCCGG AACCAGCAAC GGCATGCCGA TCATCAACAA CCAGGAATAC
AAGGGCACGA TCTCATTGAA AGACGGCGAG CCGGGCGTCG TCGTGAGCTA TTTGACGGAG
AGCGAATCGC GATCCATCTC CGGTATACCA GGCTTAGGTC AAATCCCCGG ACTCGGTTCG
GCCGTGGCAA GCACCGATCG CGAGGGCGTG GAATCCGAGC TACTCGTCAT CATCACGCCT
CACGTGCTCA AGGTCATCGA GCCCAAGATG GATACGATCG CCATGCCCCG GGGCACCTAG
 
Protein sequence
MRTALWGAIL LAAASVAAQD LPSPNTPAVE AKVPSQKAEK PSKSEEKQAK HEFSEGMKRQ 
KAGDLQQAYE HFEAASRLLP KNVEYATARE LTKQQAVMQF IQRGNTAMEK GLTIEAQGDY
RMALQIDPDN NFAQQQLKNA LPALPSTGSQ IRFSDSTDDA EYRAPVMLAP EKVKKDFHYR
GDSKGLLTQV MQAYGVTATL DDSVPSKRVR FDLDATDFEH ATEAAGTITK TFWVPLGLRQ
VLVLADTPAN RRDNQHMVLR TFYFPDATTP TDLQDLINVF RVIFDVRFVV PQPSRNSITV
RAPQPTMEAV TQFFSDLDAS RPQVALDVSV YRVSGTLTHQ LGVQPPNQFT TFNLGSVLAG
LGATNLQSLI NQIISSGAIN QANGTDISAL ITQALGNTQL ATLFQTPFVT YGGGLTLMAL
TVPGTTLNLN FSKANFQNLA HMQLRASQNN AATMRIGERY PILNATFAPI YNTPQISALL
RTGTYVAPFP SFNYEDLGLT VKATPSIQGN RDVRLNLEMQ MRSLGAGTSN GMPIINNQEY
KGTISLKDGE PGVVVSYLTE SESRSISGIP GLGQIPGLGS AVASTDREGV ESELLVIITP
HVLKVIEPKM DTIAMPRGT