Gene Acid345_2328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2328 
Symbol 
ID4071482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2757421 
End bp2759514 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content55% 
IMG OID637984344 
Productoligopeptidase B 
Protein accessionYP_591403 
Protein GI94969355 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0142443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTCG CTCTTCCGCT ACTATTGTTA GTGATGACGA ACTTGTCCCC CGCTCAAACC 
GCTACGCCGC CCGTCGCCAA GAAAGAACCC AAGACCACCG AGATCAACGG TCACACCCTC
ACCGACAATT ATTACTGGCT GCGCGATAAG CCGAATCCGG AAGTGCGCGC GTATCTTGAG
GCCGAGAACG TGTACACCGA TTCGGTGATG AAGCCGACCG AGCCGCTGCA GAAAAAGCTC
TATGACGAGA TGCTGGGCCG CATCAAGGAA ACCGATGTGG ATGTTCCCTA TCCGCAGGAT
GGTTACTTCT ATTATTCGCG GACGGAAGCC GGCAAGCAGT ATGCGATCCG ATGCCGGAAA
AAGGGCTCGC TGGATGCGCC GGAGCAAGTC GTCATTGACG TGAACGAACT GGCTAAAGGA
AAGAAGTTCA TGTCGCTGGC GGAATTCGAG CCGAGCGACG ACGGCAACCT GCTGGCGTAC
TCGACCGACA GCACCGGCTT CCGGCAATAC GATTTGTATG TGCGCGACAT GCGCACCGGC
CAGGATCTTC CGGACCACAT TGCAAAAACC GGATCGATCG CGTGGGCGAA CGACAATAAG
ACGATTTTCT ACACGGTCGA GGATTCTGCG AAGCGGCAAT ATCGCGTGTA CAAGCACACG
GTGGGTAACA CCGGCGCCGA CGATCTGATC TACGAAGAAA AAGATGAGCG CTACGACGTC
TATGTATGGA AGTCGCGGAG CAAGGGATAC ATTTTCCTGA GTTCGGCGAG CCATACGACG
AGCGAAGCAA AGTACATCCG CGCAGACCAA CCGAACGCGG AGTGGAAGCT GATTGAGCCC
CGCAAGCAGG ACGTGAAGTA CACTCCTGAC CACAATGGAA AGTTCTTCTA TCTAACTGTG
AACGACACCG GGGTCAACTA TCGGCTGGTC AAGACGCCGG TGGACAATCC CGGCAAAGCG
CATTGGCAAG AAGTGATTCC GCAGCGCGCG GAGACCATGC TCAACCGCGC GGAATTCTTC
AAGGACTTCT ACGTGTTGCA TACGCGCGAG AAGGGCCTGC CGCTGCTGCG CGTGGTGGAT
ATCACAAGCG GCAAATCGAA GGACATTACG TTCCCCGAGC CCGCGTACAA CACCATGAGC
TACATGAACG CGGACTACGA CACCAAGAAG TTCCGTTACA TGTACACATC GTTCATCACG
CCGGTTTCGA CATATGAATA CGACGTGACC ACCGGTGAGT CGAAGCTGCT GAAACAGCGA
GAGGTGCCTA ACTACGACGG CACCAAATAC AAAGTTGAGC AGTTGTATGC GCCCGCACGA
GATGGCGAAA AAGTTCCGGT CTCAGTGTTG TACGCAAAGA CAACGAAGCT CGACGGCAAA
GAGCCACTCT ACCTCTATGC TTACGGTTCT TATGGGGCCT CGATTGACAT CAACTTCAAC
TCGAATTTCT TCTCGTTGGC GGACCGCGGC GTAGTTGTGG CGATTGCGCA CATTCGCGGC
GGCGGCGAAA TGGGCAAGGC CTGGCACAAT GCCGGTCGCA TGATGAACAA GAAGAACACG
TTCAATGACT TTATTGACAG CGCTGAGTAT CTCCTGAAGA ACAACTACGG CACCAAGGAC
AAGTTGGTGA TCGAGGGCCG CAGCGCCGGT GGCTTGCTGA TGGGCGCCGT GCTGAATATG
CGTCCGGATC TCTTCCATGC TGCGATTGTC GGAGTGCCGT TCGTTGATGT AATCAACACC
ATGCTCGACG AGTCGTTGCC GCTCACGGTC GGCGAGTTTG AAGAGTGGGG CAATCCGAAG
GAAAAAGCCG CCTTCGACTA CATGTATTCC TACTCGCCTT ACGACAACAT CGATGCGAAG
CCCTATCCCA ATATGCTGGT GAAGACAAGC TTCAACGACA GCCAGGTGAT GTATTGGGAG
CCCGCTAAAT ACGTAGCGAA AATGCGTGCG CTGCGAAAAG ACGATCACCT CGTAATTCTG
AAGACGAACC TTTCGCCCGC GGGGCACGGC GGATCCAGCG GACGCTACGA TCGCATCCAC
GAATTCGCCT TCGACTATGC TTTCATCCTC ACGCAGATGG GAATCAACGA ATAG
 
Protein sequence
MRLALPLLLL VMTNLSPAQT ATPPVAKKEP KTTEINGHTL TDNYYWLRDK PNPEVRAYLE 
AENVYTDSVM KPTEPLQKKL YDEMLGRIKE TDVDVPYPQD GYFYYSRTEA GKQYAIRCRK
KGSLDAPEQV VIDVNELAKG KKFMSLAEFE PSDDGNLLAY STDSTGFRQY DLYVRDMRTG
QDLPDHIAKT GSIAWANDNK TIFYTVEDSA KRQYRVYKHT VGNTGADDLI YEEKDERYDV
YVWKSRSKGY IFLSSASHTT SEAKYIRADQ PNAEWKLIEP RKQDVKYTPD HNGKFFYLTV
NDTGVNYRLV KTPVDNPGKA HWQEVIPQRA ETMLNRAEFF KDFYVLHTRE KGLPLLRVVD
ITSGKSKDIT FPEPAYNTMS YMNADYDTKK FRYMYTSFIT PVSTYEYDVT TGESKLLKQR
EVPNYDGTKY KVEQLYAPAR DGEKVPVSVL YAKTTKLDGK EPLYLYAYGS YGASIDINFN
SNFFSLADRG VVVAIAHIRG GGEMGKAWHN AGRMMNKKNT FNDFIDSAEY LLKNNYGTKD
KLVIEGRSAG GLLMGAVLNM RPDLFHAAIV GVPFVDVINT MLDESLPLTV GEFEEWGNPK
EKAAFDYMYS YSPYDNIDAK PYPNMLVKTS FNDSQVMYWE PAKYVAKMRA LRKDDHLVIL
KTNLSPAGHG GSSGRYDRIH EFAFDYAFIL TQMGINE