Gene Acid345_1099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1099 
Symbol 
ID4069559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1376150 
End bp1377247 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content57% 
IMG OID637983108 
Productpeptidase M48, Ste24p 
Protein accessionYP_590176 
Protein GI94968128 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000343868 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000201004 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCAAGCTC CCAGAGTTCG TCTGCTGATC ACGCTCGTCT TCTTTGTCAG TTTTTCAGTT 
GCGCAAACTC TTAACAGTTC CAATGCAAGT GCGGCGCCGG TGGTAGAGGT GTCGAGCGCG
AAGTATGCGA AGGCGATGCA GAAGGTAGTG ACAAAGTACG ACGTCACAAA GATCGGCGAA
CGCAAAGTTG CGGGCGGGAT GAACTTTGTC TCCATTGAAG CGGAGGCCAG GCTCGGGCGG
CAGTTGTCGG GCGAAGCGGA CCGCATGCTG CGGTTGGTGC AGGATCCGGT CATTACTGAG
TATGTGAATC GGCTGGGGCA GAACCTGGTG CGCAATTCGG ATGCGAAGGT GCCGTTCACA
ATCAAAGTTG TGGATTCGGA AGAGATCAAC GCATTCGCTC TGCCAGGCGG ATATTTCTAC
GTGAATACCG GGCTGATTCT TGCCGCGGAT AGCGAAGCGG AGTTGGCGGC GGTGATGTCG
CACGAGATTG CGCACGTGGC GGCGCGTCAT GCGACGAAAA ATTTAAGCAA GAGGGAACTG
CTGCAGCTGT GCACACTGCC CACTTTCTTT ATCGCTGGGC CGGCGGTGAT CGCGATACGA
GAGGCGGCCC AGATCGCGTT ACCGATGACG TACATGAAAT TCTCGCGCGA TGCCGAACGC
GAGGCCGATT TGCTTGGCAT GGAGTATGCA TACGCTTCTG GGTACGATCC GCAGGCCATG
GTGACGTTCT TCCAGAAGGC GCTGGTCAGG GACCAGAAGA GGCAGCGGTT GATTGCGCGG
GCGTATGCGA CGCATCCGAT GACGGCGGAG CGCATGCAGC GCGCGCAGGC GGAAATCCAG
ACGTTGTTGC CGCCGAAAGA CAACTACATG CTGACGACCA ACGAATTCGA TGAGATCAAA
GCACGTGTGT CGCGACTGGA GAGGAACCAA CTGGTAGCCT GGGCGCCGAG CAGCAAACCA
ACGCTGCGGA ACCGGACCGA TGTGGAGAGT GCTCCGGTCA GCAATACACC GACTTTGAGA
AAAACTGTTG GTGACGGAAC CAATTTCACC GATGTTAAGA AGGACGTCCG CACGTATGCG
GAGCGGCAGT GGAACTAG
 
Protein sequence
MQAPRVRLLI TLVFFVSFSV AQTLNSSNAS AAPVVEVSSA KYAKAMQKVV TKYDVTKIGE 
RKVAGGMNFV SIEAEARLGR QLSGEADRML RLVQDPVITE YVNRLGQNLV RNSDAKVPFT
IKVVDSEEIN AFALPGGYFY VNTGLILAAD SEAELAAVMS HEIAHVAARH ATKNLSKREL
LQLCTLPTFF IAGPAVIAIR EAAQIALPMT YMKFSRDAER EADLLGMEYA YASGYDPQAM
VTFFQKALVR DQKRQRLIAR AYATHPMTAE RMQRAQAEIQ TLLPPKDNYM LTTNEFDEIK
ARVSRLERNQ LVAWAPSSKP TLRNRTDVES APVSNTPTLR KTVGDGTNFT DVKKDVRTYA
ERQWN