Gene Acid345_2940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2940 
Symbol 
ID4070864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3483561 
End bp3485219 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content58% 
IMG OID637984959 
Productpeptidase M28 
Protein accessionYP_592015 
Protein GI94969967 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.529928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATTCAC ACGTACCTCG TGTCCCCTAT CGCATCTCCC TATTTGTGTT TCTCGCATTG 
TTTTCCTGCT CTGCATACGC TCAACAGCGC GGCGAGACCG TGGACCTGGG CATGGTCACG
CAGATTCGGC AGGAAGGATT CCGGAACTCG CAAGTCATGC AAACCGCCAG CGCAATCGTC
GACGGTATCG GCGAACGGCT AAGTGGGTCG CCGAATGTCA AGAAAGCGAA CGAATGGACC
CGGGATCAAT TCACCAAGTG GGGGTTGCAA AACGCTCACC TTGAGGGCTA CAAGTTCGGG
CGCGGGTGGC AGAACGAATT CACGTCGGTC CGAATGGTCT CGCCCGACTT CATGGAACTT
ATTGCCTACC CGAAGGCCTG GACGCCGGGA ACGAGTGGCG CGATTAAGGC GCCGGCCGTG
CGCGTAGTCG CGAAGTCACC TGCCGACTTC GAAAAGTATC GCGGCAAGCT CTCCGGCAAA
ATCGTTCTCT ATGGCGATAT GCCGGAAGTG AAGCCGCAAG CGGAAGCCGC CATGAGCCGT
TATGACGAGA AGAAACTCGC CGACATCGGC CATTACGAGA TCCCAAGCGA GAAGCCGCGA
TTCAGTCCTG AGGAATTTAA GAATCGTCTA GCGCTACGGA AAGCGGCGGA CGAGTTCTTC
GTCAAGGAGA ATGTTGTCGC TGTGATTGAC GCCAGCCGCG GCGATGGCGG TACCGTTTTC
GTACAGAGCG CCGGTTCGTA TAAAGAAGGT GAGCCGGAGG TCGCGCCGTC GCTCTCGATG
GCGGTGGAAC ATTTCGGCCG CATCGCGCGA CTGCTGGAGC GCGGAGGAAA TGTCGAACTT
GAAGTAAATG TTCAGAACAA GTTCTACACA GACGACCCGA CGGCTTACGA CACCGTCGCC
GAAATCCCCG GCTCTGACAA GAAAGATGAG CTGGTAATGG TCGGAGCGCA CCTCGATTCC
TGGCATGCCG GCACCGGTGC CACGGACAAT GCTGCCGGAT GCGCGGTGAC GATGGAAGCA
GTGCGCATTT TGCAGGCACT TGGCGTGAAG CCGCGGCGGA CTATCCGTAT CGCACTATGG
ACAGGCGAGG AAGAGGGATT GCTCGGCTCG CGCGCGTATG TCGAGCAACA CTTTGGGTCG
CGACCGGAGA GCACTGATCC GAAGGAAAAA GACCTGCCGT CGTTCCTGCG CAAGCCGGGA
ACTCCACTCA CCCTCAAGCC GGAGCAGAAG CAGGTTTCGG CATACTTCAA CATCGACAAT
GGAAGCGGTA AGGTCCGCGG CATTTACCTG CAGGAGAACG CCGCCGTTGC ACCGATCTTC
ACCGAGTGGC TAAAGCCATT CCATGATCTC GGCGCCGATA CGATCACGTA TCGCAACACG
GGCGGCACGG ACCACCTCTC GTTCGATGCG GTAGGTATTC CGGGTTTCCA GTTCATCCAG
GACCCGATTG AGTATGAGAC TCGCACTCAC CACTCGAATA TGGATGTGTA CGAACGCCTG
CAACGTGACG ATCTGATGCA AGCTTCGGTA GTACTCGCAT CGTTCATCTA TAACGCAGCA
ATGCGCGACG ACATGATGCC GCGCAAGCCG CTACCGAAGG ATGCGGTGCT CCCGCCCGCA
TCCGCGGCTC CGGCAAAGAC CCCTGCAAAG AAAAAGTAG
 
Protein sequence
MHSHVPRVPY RISLFVFLAL FSCSAYAQQR GETVDLGMVT QIRQEGFRNS QVMQTASAIV 
DGIGERLSGS PNVKKANEWT RDQFTKWGLQ NAHLEGYKFG RGWQNEFTSV RMVSPDFMEL
IAYPKAWTPG TSGAIKAPAV RVVAKSPADF EKYRGKLSGK IVLYGDMPEV KPQAEAAMSR
YDEKKLADIG HYEIPSEKPR FSPEEFKNRL ALRKAADEFF VKENVVAVID ASRGDGGTVF
VQSAGSYKEG EPEVAPSLSM AVEHFGRIAR LLERGGNVEL EVNVQNKFYT DDPTAYDTVA
EIPGSDKKDE LVMVGAHLDS WHAGTGATDN AAGCAVTMEA VRILQALGVK PRRTIRIALW
TGEEEGLLGS RAYVEQHFGS RPESTDPKEK DLPSFLRKPG TPLTLKPEQK QVSAYFNIDN
GSGKVRGIYL QENAAVAPIF TEWLKPFHDL GADTITYRNT GGTDHLSFDA VGIPGFQFIQ
DPIEYETRTH HSNMDVYERL QRDDLMQASV VLASFIYNAA MRDDMMPRKP LPKDAVLPPA
SAAPAKTPAK KK