Gene Acid345_2595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2595 
Symbol 
ID4070558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3063546 
End bp3064559 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content60% 
IMG OID637984612 
Productpeptidase M50 
Protein accessionYP_591670 
Protein GI94969622 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAGCA CTTCACTCGG ACCAGCTTCG AACTGCCGGC GATGCGGCTC ACCGTTATCG 
CCGGGCGCGC TGGAGTGTCT CCAGTGTCAT GCGTTCGTGC ATGCCGACGA GCTCACCCGG
ATCTCAAACG AAGCGAAGCA GCTGGAGAGT GCCGGCGATT TGCGCGCGGC GCGCGAGAGA
TGGCTAACGG CAATTCCGTT ACTTCCAGTG GATGCACAAC AGACGGCGTG GATCAACAAC
CATGCCGTGG ATTTGTACGC ACAGGCGCTG GAGGCCGACG TACAGCGACA GCAGGAGGGC
GAGTCCAAGA GCAAATGGGC TAAGCGGCTC GGACCGTTTG CACCCCTGTT GGTGCTGCTT
GGGAAAGCGA AAGTGCTGTT CGGACTGTTG AAGCTCAAGT TCGTCCTGAG CTTCGCCGGC
TTTATATGGT TCTACTGGAC GATTTTCGGG ATGTGGTTCG GCGTCGGATT CGCGGTGCTA
ATCCTCTGTC ACGAGATGGG GCACTACATC GAGGTGAAGC GGCGCGGACT TCCTGTGGAA
GTGCCGGTGT TTCTACCGGG CCTTGGCGCA TATGTGAAGT GGAAGAACCT CGGCGTGTCC
GGGGAAGGGC GGGCGATGAT CAGCCTGGCC GGTCCGCTGG CGGGATTTCT ATCGTCGGCG
GTGTGCATTT TGGTTTATTG GCAGACGCAC TCGAAGCTGT GGCTGGCGCT GGCGCACTCG
GGAGCCTGGT TGAACCTGAT GAACCTGATT CCGGTGTGGG CACTGGACGG GGCTCAGGCG
ATCCAGGCGG TGAGCCGTTA TGGCAGGGTG GTGCTGTTGT TGAGCTGTGC GCTTCTATTC
TGGGGAACAC GCGACTATGT GTTGTTGTTT ATTGGAGCCG GAGTATTGTG GCAAGCGTTT
GTGAAGCCGA CCGACGAGAC GAGCGCACGG ATTGCGGCAT ACTTCACGCT GCTGCTCTGC
GCGTTGGCGC TGATTCTGGT GCTAGCGCCT GGACGAGGGT TCGGGGCGGA GTAG
 
Protein sequence
MSSTSLGPAS NCRRCGSPLS PGALECLQCH AFVHADELTR ISNEAKQLES AGDLRAARER 
WLTAIPLLPV DAQQTAWINN HAVDLYAQAL EADVQRQQEG ESKSKWAKRL GPFAPLLVLL
GKAKVLFGLL KLKFVLSFAG FIWFYWTIFG MWFGVGFAVL ILCHEMGHYI EVKRRGLPVE
VPVFLPGLGA YVKWKNLGVS GEGRAMISLA GPLAGFLSSA VCILVYWQTH SKLWLALAHS
GAWLNLMNLI PVWALDGAQA IQAVSRYGRV VLLLSCALLF WGTRDYVLLF IGAGVLWQAF
VKPTDETSAR IAAYFTLLLC ALALILVLAP GRGFGAE