Gene Acid345_0806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0806 
Symbol 
ID4068685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp998087 
End bp999778 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content59% 
IMG OID637982813 
Productpeptidase M28 
Protein accessionYP_589885 
Protein GI94967837 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.661029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAATG CCGTGTTGTT GCTCATTGCG ATGAGCGCAA TGTCGTGCGC CATTGCACAA 
GGATCGCCGG AGAAGCCGAA ATCCGGAGCC AAGCAATCTG CCGCAAGCGT GCCCGCATCG
GCGCAGTCCG CGATCAACAA TGTCAACCCG GAGATCATCC GCGCGCACGT GCGTTTCCTC
TCACTGGATC TGCTTGAGGG TCGCGGAACA GGACAGCGTG GTGGCGACAT TGCCGCCGAA
TATATCGGCA CGCAGTTTGC TAGTTACGGA CTGAAGCCGA TAGGCGATAA CGGCAGCTAT
CTCCAGAAGG TCGAAATGCT TGGTATCAAG ACCAAGCCGG AGTCGAGCTA CGCGGTCACG
ACGGAGAAGG GACAGAAGCT GGATCTGAAG CCGGGACAGG ACATCGTTTC GATGGACGAA
ACCGGCAATG CCTCCGATGA GATCGACGCA CCTATCGTTT GGGTCGGGTT CGGCATCACT
GCGCCCGAGT ACAGGTGGGA CGATTACAGC GGCCTCGACG TGAAGGGCAA AGTGCTGCTG
ATGCTGGTGA ACGAGCCACC TTCTACGGAC CCGAACTTCT TCAAGGCCAA GGCACTCACG
TATTACGGAC GCTGGACATA CAAGTACGAG CAGGCCGCTC GCATGGGCGC AGTCGGCGTG
ATCCTCGTGC ACCAGACCGA AATGGCGAGC TACGGATGGG ACGTGGTCCA CAACTCGTGG
GGAGGCGAAC GTTCCTACCT GCGCGCGAAC AACGAGCCGC AGCTGAAGGC GGCGTCATGG
ATTCAGTACA ACGTGGCGAA ACAGATCTTT GCCGACTCCG ATATCGACAT CGCAAAGGCA
ATCAGTGAGG CGGGACGGCC GGGCTTCAAG GGACGCGAGT TGAACGCGCA TTTCAAGGCA
CACGTGGAGA GCGCGGTGCG TCCGTTCAAC TCCTACAACG TCGTCGCGGA GTTGCAGGGA
TCGGATTCGA AGCTGACGGA CCAGGCAATT ATGTATTCCG CGCACTACGA TCACCTTGGT
ATTGATCCGA ACCAGTCGGG TGACAACATC TACAACGGGG CGGCCGATAA TGCGACGGGT
TGCGGCATTC TGTTGGAGAT CGCCCGCGTG ATGTCAGCAT CGCCGGTGAA GCCGAAGCGA
TCCGTAATCT TCGCGTCGGT AACCGCGGAA GAACAGGGTC TGCGCGGATC GGAGTATCTC
GGGAAGAACC CGCCGATCCC CGCACGCAAT ATCAGTCTCG ACTTGAACTT CGACGATGTC
CCGCCGATCG GCATGCCGGA GGAAGTGCAG GTGAGTGGCG CGGAACGCAC GACGTTCTAC
CCAGTCGTGG AGCAGACCGC AAAGGAGTTC AAGCTCACGA TTGTTCCAGA TTCGCGGCCC
GAAGCTGGCC ACTACTATCG CTCCGATCAT TTCAGCCTGG CGCGCGTTGG AGTGCCGGCC
TTCTCGGTGA ATGAAGGCGA GAAGTTCGCC GGCCACGACA AAGACTGGGG TGAGAAGCAG
GCGCAGGACT ACAACAAGAA CCGTTATCAC CAGGTCACAG ACGAGTACAA GCCAGAAATG
GACTTCTCAG GCGACGCGCT GATGGCGAAG TTCGGTGTAG CGCTGGGCTG GAAGGCAGCA
AACCAGCCGA ACAAGGTGGG GTGGCGTGCG GGAGACGAGT TCGAAAAGGC CCGCAAAGCC
AGCGAACAGT AA
 
Protein sequence
MRNAVLLLIA MSAMSCAIAQ GSPEKPKSGA KQSAASVPAS AQSAINNVNP EIIRAHVRFL 
SLDLLEGRGT GQRGGDIAAE YIGTQFASYG LKPIGDNGSY LQKVEMLGIK TKPESSYAVT
TEKGQKLDLK PGQDIVSMDE TGNASDEIDA PIVWVGFGIT APEYRWDDYS GLDVKGKVLL
MLVNEPPSTD PNFFKAKALT YYGRWTYKYE QAARMGAVGV ILVHQTEMAS YGWDVVHNSW
GGERSYLRAN NEPQLKAASW IQYNVAKQIF ADSDIDIAKA ISEAGRPGFK GRELNAHFKA
HVESAVRPFN SYNVVAELQG SDSKLTDQAI MYSAHYDHLG IDPNQSGDNI YNGAADNATG
CGILLEIARV MSASPVKPKR SVIFASVTAE EQGLRGSEYL GKNPPIPARN ISLDLNFDDV
PPIGMPEEVQ VSGAERTTFY PVVEQTAKEF KLTIVPDSRP EAGHYYRSDH FSLARVGVPA
FSVNEGEKFA GHDKDWGEKQ AQDYNKNRYH QVTDEYKPEM DFSGDALMAK FGVALGWKAA
NQPNKVGWRA GDEFEKARKA SEQ