Gene Acid345_4664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4664 
Symbol 
ID4070709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5521295 
End bp5522875 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content57% 
IMG OID637986704 
Productpeptidase M28 
Protein accessionYP_593738 
Protein GI94971690 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.062927 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCG AAATTCCGTC TTGCACCCTG TCAGTATATT TAGAATCTCT TACAGCTGGC 
GGTCAAGAGC AACGCCAAAT TTTACGCGCC CTTGCTCAGG GCTGCGGAAA CGGCATCCAA
CCGGATGTTT GCCGCGGAAA GATTCGAATG TCCTTCGTAT CTCGATTGAA CTCTGCATGT
CGAACCAGCG TTTCTTTGAT GCTTGCCAAC GCTATTGGTT GCGGGCTTGT GATGTTCAGC
GGCGTATCCG CCGTTGCCCA GTCCAAGCCA GTGAAGAAAC CATCTCAAGT ATCGCCGGAA
ATCCAGGCAA TACTGCGCGA TGTTTCAGCA AAGCAGATCG AGGCCAACAT CAACAAGCTC
GTAAGTTTCG GCAACCGTTC CACGCTTTCG TCGGATGTAC CTGCCGATAG CGGTAAAGGC
ATTACCGCGG CGCACGAATG GATCAAGAGC GAATTCGAGC GCTACTCGAA AGAATGCGGC
GGTTGTCTCG AGGTAAAGAC GGACGACTTC ACCGAAAGTC CCATGGACCG CATCCCGAAG
CCCACACAAA TTACCAATGT TTACGCGGTT CTGAAGGGCA CGGACCCGGC GAATGCCGAC
CGCATCGTGC TCGTCACCGG ACATTACGAT TCACGCAATT CGACGAACGA GAACACCACG
GATCCCGCGC CAGGAGCCAA TGACGATGGC AGCGGAACGG CGGTTTCGCT CGAATGTGCG
CGAGTGCTGA GCAAGCACAA GTTTCCGGCG ACGATCATCT TCCTCACCGT CGCGGGCGAA
GAGCAGGGAC TGAATGGCAG TAAACACTTC GCCAAGATGG CGCGCGCGCA AGGCTGGCAG
ATCGAAGCAG CGTTGAATAA CGACATCGTC GGCGGCAACA AAACACCGGG CGATACGACG
CAGAACCCGC ACACGGTGCG GGTGTTCTCC GAAGGCGTTC CCGCAAACGC TACGGAAGCC
GACCTCAGGT TGATCCGCGC CACTGGTACG GAGAATGACT CGCCGTCGCG CGAACTGGCG
CGCTACGTCG GTGAAGTGGG CAAGGCCGAC TTGCCGAAAA CCTTTCAACC GACACTGATC
TACCGCCGCG ATCGCTTCCT CCGCGGTGGG GACCATAGCA GCTTCAATAT GGAAGGATTT
GCCGCCGTCC GGTTCACGGA ATGGCGTGAG GACTTCCACC ATCAGCACCA GAACCTGCGG
ACGGAAAACG GCATCGAATA TGGTGATTTG CCGAAGTTCG TGGACTTCGA GTACGTTGCC
AACGTCGCGC GGTTGAATGC CATTACGCTG GCTACACTGG CGATGGCGCC TGCTCCTCCA
GACAATGCCC GGTTACAGAC CAAAGACCTG GAGAACGGCA CCACGATTAC GTGGAAGCCC
TCGCCGGGCG GGCTCGCGAC GGGCTACGAG GTGGTTTGGC GCAACACTTC ATCGCCCGAT
TGGGAAGAAT CGAAGGGCTT CCCAGCCGAT GCGAACAGCG CCAAGATCGA CGTGTCGAAA
GACAACGTCA TTTTCGGGAT CCGCGCCGTG GGCAAGAACG GGCTGAAGAG CCCAGTCGTA
ATTCCTCCGC CGGAGAGGTA G
 
Protein sequence
MSREIPSCTL SVYLESLTAG GQEQRQILRA LAQGCGNGIQ PDVCRGKIRM SFVSRLNSAC 
RTSVSLMLAN AIGCGLVMFS GVSAVAQSKP VKKPSQVSPE IQAILRDVSA KQIEANINKL
VSFGNRSTLS SDVPADSGKG ITAAHEWIKS EFERYSKECG GCLEVKTDDF TESPMDRIPK
PTQITNVYAV LKGTDPANAD RIVLVTGHYD SRNSTNENTT DPAPGANDDG SGTAVSLECA
RVLSKHKFPA TIIFLTVAGE EQGLNGSKHF AKMARAQGWQ IEAALNNDIV GGNKTPGDTT
QNPHTVRVFS EGVPANATEA DLRLIRATGT ENDSPSRELA RYVGEVGKAD LPKTFQPTLI
YRRDRFLRGG DHSSFNMEGF AAVRFTEWRE DFHHQHQNLR TENGIEYGDL PKFVDFEYVA
NVARLNAITL ATLAMAPAPP DNARLQTKDL ENGTTITWKP SPGGLATGYE VVWRNTSSPD
WEESKGFPAD ANSAKIDVSK DNVIFGIRAV GKNGLKSPVV IPPPER