Gene Acid345_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1420 
Symbol 
ID4068799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1719789 
End bp1721129 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content58% 
IMG OID637983429 
Productpeptidase M50, putative membrane-associated zinc metallopeptidase 
Protein accessionYP_590496 
Protein GI94968448 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.130402 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGTT TTCTCATAGC GATTGTGTCG ATCGTGTTCG TACTCGGGGT GTTGGTTCTC 
GTTCACGAAT TCGGTCATTT TGCCGCTGCC AAATTGTTTG GCGTTCGCGT GGAGACATTC
TCCATCGGGT TCGGAAAGCG GCTCGTCGGT TTCCGGCGCG GTGAGACCGA TTACCGCATC
AGCGCTCTCC CGCTCGGCGG GTACGTGAAG ATGACTGGCG AAACTCCCCT GGACAGCCGT
ACCGGCGCGC CCGAGGAGTT CATGTCTCAT CCGCGATGGC AGCGGATCAT CATCGCCCTC
GCCGGCCCGT TCATGAATAT CGCGCTCGCC ATCGGACTGC TGACCGTGGT CTACATGGTG
CACGACGAAG AACCGGCATT CTGGGGAGAA AAAGCGACGG TGGGGTTCGT TGCTCCGGGT
AGCACCGCGG ACAAGGTCGG CGTGAAAGCC GGCGACACGA TCGTAAAAAT CGCCAACGTC
GATAATCCGA CTTGGGAAGA CGTTTATCTC CAGACCTCCA CGAGTCCGGG AGCGGCGGTA
CGGCTCGATC TCCTGCGCGA CCGGCAGGTG ATTGCGACTT CGGTTGTCCC CGAGAAGGGG
TCTGAAGAGC AGGGCAGCAA TCCAGGTTGG ACTCCGTACA ATCCGAACAC CGTTGCCAGT
CTTGAAGCGG AGATGCCTGC TGCGAAGGCT GGCATCAAAG TCGGCGATTC GATTACGGCG
ATCGATGGCG CACCGATCTA CTCGACAGAG TCGATGATCG CCATGCTGCA ACAGACAAAG
GAAAAGCCGG TCGAGCTGAC TGTGCAGCGT GACGGCAAGG AATTTAAGGT TACTGTGACG
CCGCAGCTGA CCAACGACAA GGGCGAGAGC CGATATCGCA TCGGCATGGT ATCGGAGCCG
AAGTACATCT CGCTTCACTT GCCGTTCAAG GCTGCGCTGT CGAAATCTCT CGACGAAAAC
CGGAAGTTCT CATTCCTCGT CGTTGACCTG GTCAAAAAGC TGGCGCGGGG CGCAGTTTCG
ATTAAGACGA TGTCGAGCCC GATCGGCATG GCGAAGGCAT CCGGCGAAGC AGCTCGGCAG
CCCGGTTGGT CGCCATTGAT GCGGATGATG GCGCTCTTCA GCTTGCAGCT TGGGATCTTC
AACTTGTTCC CGATCCCCAT CCTCGATGGC GGCATGATCC TGATGCTATT GATCGAAGGC
CTTATGCGCC GCGACATCAG CATGCGAATC AAGGAACGCG CCTATCAAGT CGCGTTTGTC
TTCCTGATGC TGTTCGCAGC CGTGGTCATC TTCAACGACA TTGTGAAGAC GGTTCCCGGG
CTACACCAGC ACTTGCCGTA A
 
Protein sequence
MEGFLIAIVS IVFVLGVLVL VHEFGHFAAA KLFGVRVETF SIGFGKRLVG FRRGETDYRI 
SALPLGGYVK MTGETPLDSR TGAPEEFMSH PRWQRIIIAL AGPFMNIALA IGLLTVVYMV
HDEEPAFWGE KATVGFVAPG STADKVGVKA GDTIVKIANV DNPTWEDVYL QTSTSPGAAV
RLDLLRDRQV IATSVVPEKG SEEQGSNPGW TPYNPNTVAS LEAEMPAAKA GIKVGDSITA
IDGAPIYSTE SMIAMLQQTK EKPVELTVQR DGKEFKVTVT PQLTNDKGES RYRIGMVSEP
KYISLHLPFK AALSKSLDEN RKFSFLVVDL VKKLARGAVS IKTMSSPIGM AKASGEAARQ
PGWSPLMRMM ALFSLQLGIF NLFPIPILDG GMILMLLIEG LMRRDISMRI KERAYQVAFV
FLMLFAAVVI FNDIVKTVPG LHQHLP