Gene Acid345_1442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1442 
Symbol 
ID4071631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1740708 
End bp1741961 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content60% 
IMG OID637983451 
Productpeptidase M20 
Protein accessionYP_590518 
Protein GI94968470 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCAG TCGAACCACT TCGGGTAGAG GCCACAAGCA AGCGCGGTCG GTTAAGCGAG 
ACCCAATGGG CGCGTTCCGC CCAGCAGCGC GTCGTTCGCC TTGCCCAATT ACAGGTGGTG
CACGACGCCT TCGACTGGCT TCACCGCAAG GAATTTCAGC TTCAGGAACG CCAGATTGAG
CTCGCAAAAA TTCCCGCTCC TCCGTTTGGT GAGGCCGCCC GTGCAGACTG GCTACAGTTC
CGCTTCCGCG AGCTGGGATT GAAAGACATT CATACCGACG CCGTCGGTAA CGTTCTTGCC
ATCCTGCCCG GCGCCAACAC CTCTGCCGGC TACGTCTCGA TCAGCGCGCA CCTTGACACC
GTTTTCCCTG CCGAGACCAA ACTCGATATC AAACGTGATC GTGACCGCAT GGTTGGTCCC
GGCATTTCGG ACAACGCCGC CGGCGTAGTA GCCATGCTTG CCATCGCTGG CGCCATCAAA
GACGGCCGCA TTCGCCCCTC AGCGCCAATC CTTTTTATCG GCAATGTTGG GGAGGAAGGC
GAAGGCAATC TCCGCGGCGT CCGCCATATT TTCGACGACA CGCGCTGGAA GGACGCTATC
GAGTTCTCCA TCATCCTCGA CGGTGCAGGA ACCGATTCGA TCATCGCGGA AGGTCTTGGC
AGTCGTCGCT ATGAAGTCCG CGTTAATGGC CCTGGCGGCC ATTCCTGGAG CGATTTCGGA
GCACCCAACC CGATCGTCAT ACTTTCCAAG ATCATCCAGG AATTCAGTGC CACTTCCCTT
CCCTCGAAAC CGAAAACTAC CTACAACATC GGCGTGATCC AGGGCGGCAC TTCCGTCAAC
TCCATCCCGG AGATGGCCTG CATGCGCGTG GACTTGCGGT CGGTAGCAGT CGCCGAAATC
GAAAGATTGG AAGAAGCTCT GCGCCGCGCC GCGGACTCCG CAAAGACCCG CGACGTGGAA
ATCGAAATCC GCACCATCGG TGATCGTCCT GCCGGCGAAC TCAAAAGCGA TTCGCCCATG
CTGGCCATGG TCCGCGCCGT AGACGCGCAT CTCAGCATTA CCTCACGCAT CCAGCGCGCC
TCCACGGACG CCAACATCCC GCTCTCACAA GGACGCGAGG CGATCGCACT CGGCGCAGGT
GGAACCGGAG CCGGCGCCCA CACTCTTCAC GAGTGGTTCG ATCCCTACGG ACGCGAAGTA
GGCTTGAAGC GTATCCTGCT TCTGACGCTG GCTCTCACCG GCATCGACGA GTAA
 
Protein sequence
MSAVEPLRVE ATSKRGRLSE TQWARSAQQR VVRLAQLQVV HDAFDWLHRK EFQLQERQIE 
LAKIPAPPFG EAARADWLQF RFRELGLKDI HTDAVGNVLA ILPGANTSAG YVSISAHLDT
VFPAETKLDI KRDRDRMVGP GISDNAAGVV AMLAIAGAIK DGRIRPSAPI LFIGNVGEEG
EGNLRGVRHI FDDTRWKDAI EFSIILDGAG TDSIIAEGLG SRRYEVRVNG PGGHSWSDFG
APNPIVILSK IIQEFSATSL PSKPKTTYNI GVIQGGTSVN SIPEMACMRV DLRSVAVAEI
ERLEEALRRA ADSAKTRDVE IEIRTIGDRP AGELKSDSPM LAMVRAVDAH LSITSRIQRA
STDANIPLSQ GREAIALGAG GTGAGAHTLH EWFDPYGREV GLKRILLLTL ALTGIDE