Gene Acid345_0410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0410 
Symbol 
ID4068728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp471430 
End bp473568 
Gene Length2139 bp 
Protein Length712 aa 
Translation table11 
GC content59% 
IMG OID637982413 
Productpeptidase S9, prolyl oligopeptidase 
Protein accessionYP_589489 
Protein GI94967441 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0851483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAC TTTCTAACGT TTTAATTCTG CTGTTTCTCA GCATCGGGTC CTCACTCTTC 
GCGCAGCAGA AACCCAAGCT CACGCTCGAT GAGTTCTTCA ACGCCGTTTA CTACCGCGAC
GTCCATATCG CTCCCGACGG CAATGCTGTA GTCTTCGCCA CCGAGCGCGC CGATTGGGAC
AATGACCGCT TCCGCGAAGA TCTCTGGCTC TACCGCACAA ATGGCGGTTC TCTCATCCCG
CTCACGCGGA CCGGTCACGA CAGCAGCCCC CAATGGTCGC CTGACGGTCA GTGGATCGCC
TTCCTCAGCG ATCGCGCCAC CGACAGCGAC AGCACCAAAG ACGACGATGA CGACTCCAAG
GACAAATCGA AGGGCGTCTC GCACGTCTAT GTGATTTCTG CCACCGGCGG CGAAGCCTTC
GCCGTCACGC GCGGAGAGGA AGAAATCCAC GCCTTCGCCT GGGCACCCGA CTCGAAGGGC
ATCTACTTCG CCACTCGCAC CCCGTGGAGC AAAGAGAAAA AGGACGCATA TAAAGAAGCT
TGGAAGGACA CCGTCCAGTA CCGCGCTTCT GAACGTGGCG ACGTAATCGC CCGTATCGCG
CTCGCCGATG TCATGGCACG CCACTTCTCT CTGACCTCGG AACCCAAAGA GAAAAAGAAG
AAAGACTCCG ACAAGGAAGA CAAGCCCGAG ACTGCCGAAA CTCCCGGATC GGCCCTGATC
ACCAGCACTC CCTTCAGGGT TCGCCAGCTT GCTGTCTCGT CGGACAACGC TCGTCTCGCC
TTCCTCACCG GCTCCATCTC GGAGCGAGAA GAGAGCGCTG CCGACTTCGA GATCTACGCC
GCCGAAATCG CCGGCACCGT TCCTGAACAA GCCAAGCGGC TCACCAATAA CAGCGGCATC
GAGTTCGAAA TCCGCTGGGC CGCCGACAAT CACCATCTCT TCTTCCAGAA TTCCGAAGGC
TCCGTTGAAT CCACGAAATA CGCCGACACC CAGAATCGCA TCTACTGGAT CGACGCCACC
ACCGCTCAGA TCGAGCGCTG GGCCGCAACC TACAAAGGCC ACGTCTCTGA ATATGCTCCG
CTGAAGCCTG ACACGCTCCT CACCGCGGGA ATGTCTGGCA CCGAGACCCA GCTCTACACC
GCAACGGGGG CAAAGGGAGA AGTAAAGAAG ATCAGCTCCT GGCCCGGCAC GTACGCGAAC
GTTTCGGCAA TCGCAAGTTC CCCACGCGTG GCTTTCATCT ACTCTCGCGT AAATAAGCCG
ACGGAAATCT ATCTCGCCGA CAGCCTCGAC CAGCTCGCCG ACGCCAAGCC CATCACCAGC
TTCAACAAGC TCTTCACCGA ACGCGCTCTG CCGGAAGCCA AGCCCTTCCA GTGGAAAGCC
GATGACGGCG TTACCGTCGA AGGCATGCTC ATCTATCCTC CCGGCAAGTT CGGTGAGAAG
AACCTGCGCA TGTTCACCTT TATCCACGGT GGTCCGATTG ACGCCGACGG CGACCACTTC
GGTGCCGACT GGTACGACTG GGCACTCCTC GCCGCCTCCG AGGGCTGGCT CGTCTTCCGA
CCAAATTACC GCGGGTCCAC TGGCTACGGG GATGAATTCG AACAGCAGAT CGCCCCGCAT
CTCGTCTCCA AGCCCGGCAA AGACATCCTT GAAGGCGTAG ACGCTCTCGT TAAAGCTGGC
ATCGCCGATC CCAACCAGCT CGCCATCGGC GGCTACAGCT ACGGCGGCTA CATGACCAAC
TGGCTCATCA CCCAGACCAC GCGCTTTAAG GCAGCCGTCA CCGGCGCCGG CGCCGTCGAA
CACGCCGCCA ACTGGGGCAA CGACGACACT ACCCTCGATG ACTCCTGGTA CCTCGGCGGC
GCTCCCTGGG AAGCCACCAA GATGTACACC GACGAAGCCG CGCTCTATCA GGCCAACAAA
ATCAAAACTC CCACCCACAT GGTTGCCGGC GGTGACGACA TCCGCGTCGC CGTGCTCGAA
GACTATCTTC TCGAACACGC TCTTAAGACC CTCGGCATCC CCAATGCGCT CCTGATTTTC
CCCGGCGAAG GCCATGGTCT TGGCAAGAAC CCATGGCATG GGAAAATCAA AGTCCGCGAA
GAGATAAAGT GGTTGGAGAA ATACGCACCC GCTAAATGA
 
Protein sequence
MRKLSNVLIL LFLSIGSSLF AQQKPKLTLD EFFNAVYYRD VHIAPDGNAV VFATERADWD 
NDRFREDLWL YRTNGGSLIP LTRTGHDSSP QWSPDGQWIA FLSDRATDSD STKDDDDDSK
DKSKGVSHVY VISATGGEAF AVTRGEEEIH AFAWAPDSKG IYFATRTPWS KEKKDAYKEA
WKDTVQYRAS ERGDVIARIA LADVMARHFS LTSEPKEKKK KDSDKEDKPE TAETPGSALI
TSTPFRVRQL AVSSDNARLA FLTGSISERE ESAADFEIYA AEIAGTVPEQ AKRLTNNSGI
EFEIRWAADN HHLFFQNSEG SVESTKYADT QNRIYWIDAT TAQIERWAAT YKGHVSEYAP
LKPDTLLTAG MSGTETQLYT ATGAKGEVKK ISSWPGTYAN VSAIASSPRV AFIYSRVNKP
TEIYLADSLD QLADAKPITS FNKLFTERAL PEAKPFQWKA DDGVTVEGML IYPPGKFGEK
NLRMFTFIHG GPIDADGDHF GADWYDWALL AASEGWLVFR PNYRGSTGYG DEFEQQIAPH
LVSKPGKDIL EGVDALVKAG IADPNQLAIG GYSYGGYMTN WLITQTTRFK AAVTGAGAVE
HAANWGNDDT TLDDSWYLGG APWEATKMYT DEAALYQANK IKTPTHMVAG GDDIRVAVLE
DYLLEHALKT LGIPNALLIF PGEGHGLGKN PWHGKIKVRE EIKWLEKYAP AK