Gene Acid345_2528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2528 
Symbol 
ID4072172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2984787 
End bp2986394 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content60% 
IMG OID637984545 
ProductTPR repeat-containing protein 
Protein accessionYP_591603 
Protein GI94969555 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.652305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATTTTC GCCGCGGCCT CATTCTCGTT TTCTGCTGTT TGCTGCTGAG CCTTACTGCT 
GTTGCCCAGG CACACGCGGG GGCACAAACG CTCCTCGTCC TCCCCTTCGA CAACGCATCC
CGCGCCCCAG GCCTCGAATG GATCAGCGAG TCCTTCCCTG AGTTGCTCGG TCAGCGCATG
GCCTCCCCCT CAACCTATGT CATTAGCCGT GACGAACGCC TGCTCGCCTT CGACCGCTTC
GGCATCCCTC AAACCCTTCA TCCTTCCCTC GCCACGCTCT ATCGCATGGC CGAGCAAATG
GACGCCGACT ACGTCGTTAT CGGCCACTAC ACGTTCGACG GCAACACTTT CACCGCAAAT
GCGCAGCTTC TCGACATGAA GTCACTGAAG TTGGAGCCAT CCGTCACCGC TAGCGGCCCG
CTCACCACGC TGATGAATAT CCAGAGCACC CTCGCATGGG ACCTGATCGG CGAAATGCAA
AGGCAGCCGA CTGGCTCGAA GGACGAGTTC CTCCGCGCCT CCTCCGGCAT TCGACTCGAC
GCCTTCGAAA ACTACGTTCG TGGCATCACC GCCGGCACCC GCCAGGAGAA GATCAATCGC
CTGCGCGAGG CCAACCGCCT CAGCCCGAAT TACACCCGCG CCACGCTCGC GCTCGGCAAG
GCTTATCTCG ACAATCGCGA CTACGATCAG GCCGTCAACT GGCTCTCGCG AATTCCAAAA
AACGATCCGC TCGCAAATGA AGCCAGTTTC GATATCGGCA TCGCGGCCTT TTATCGTGGC
GACTTCGAAC GCTCCGCTGA GGCCTTCAAT TTCCTGCTGA CCCGTTTGCC CATGCCCGCG
ATCTACAACA ACCTTGGTGT CATCGCCGCC CGGCGCGGCC GGAAAACCGA AGCAGACCTG
CTGCAGAAAG CTGTCGCTGC CGATCCCACC GACGCCGACT ACCGTTTCAA CCTCTCTGTT
GCGCTCGCCC GTGGCGGCGA CAACGCCGGC GCAGTGCGAC AACTTCGCGA CGCCCAAAAG
ACCCATCCCG ACGACGCAGA GATCAAGTCT CTTCTGGATC AGCTTCAAGG CGCGGCAGTC
TCCAACGTTT CGCACACCCA GGCAGCGCAG CTCAAACTCC CTATGGAGCG CCTGAAACGC
ACCTACGACG AAACTTCGTA TCAGCAAGTG GCCATGGAAA TCGAGAACGT CGCTGAGCAG
CGCCTCGCGC AGGCCGATCC CAAGACTCAC GCTGCCTACC ATCTCGAACG CGGACGCGAT
CTGTTGAACC AGGGCTTCGC CGCGCAAGCC GAAAAACAAT TCCGCGAGGC GTTGCAATAC
GATCCAAACA ATGCCGTTGG TCATGCAGGC TTGGCACGCT CGCTTGAATC GAGCGACCCG
GCGGCCTCCG CGCGCGAAGC CGACGCATCC CTCAAACTCC AGTCGAATGT CGACGCCTAC
CTAGTCGTGG CACGCTTGGC CGTCGCCCGC AAAGACACCC GCAAAGCGAA TGACGCTGTC
GATTCGGCCT TGAAACTTGA GCCTGCGAAT TCCGCCGCAC TGGCACTCAA ACGAAGCATA
GAATCAAAGG CGGGTCAGGC CCCGCCATCC GGCAGCACGC CGGAGTAG
 
Protein sequence
MNFRRGLILV FCCLLLSLTA VAQAHAGAQT LLVLPFDNAS RAPGLEWISE SFPELLGQRM 
ASPSTYVISR DERLLAFDRF GIPQTLHPSL ATLYRMAEQM DADYVVIGHY TFDGNTFTAN
AQLLDMKSLK LEPSVTASGP LTTLMNIQST LAWDLIGEMQ RQPTGSKDEF LRASSGIRLD
AFENYVRGIT AGTRQEKINR LREANRLSPN YTRATLALGK AYLDNRDYDQ AVNWLSRIPK
NDPLANEASF DIGIAAFYRG DFERSAEAFN FLLTRLPMPA IYNNLGVIAA RRGRKTEADL
LQKAVAADPT DADYRFNLSV ALARGGDNAG AVRQLRDAQK THPDDAEIKS LLDQLQGAAV
SNVSHTQAAQ LKLPMERLKR TYDETSYQQV AMEIENVAEQ RLAQADPKTH AAYHLERGRD
LLNQGFAAQA EKQFREALQY DPNNAVGHAG LARSLESSDP AASAREADAS LKLQSNVDAY
LVVARLAVAR KDTRKANDAV DSALKLEPAN SAALALKRSI ESKAGQAPPS GSTPE