Gene Acid345_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1930 
Symbol 
ID4071041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2320039 
End bp2321979 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content57% 
IMG OID637983942 
Productpeptidase S9, prolyl oligopeptidase 
Protein accessionYP_591005 
Protein GI94968957 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTCAC TACGTCGCAT TGCAATACTC GCCCTGTGCA CTCTGTTTGT TTTGCCTGCG 
TTCGGGGAAG AGAAGTTCAC GACCGAGTTG ATTCCGCGCT CGGTGTTGTT CGGCAACCCA
GAACGGGCCG ATCCGCAAAT CTCGCCGGAC GGGAAACAGC TTGCGTATCT CGCGCCGGTG
AATGGCGTAC TGAATGTGTG GGTGCGTACG CTCGGCAAGA CTGACGATCG TGCCGTGACC
AGCGACACCA ATCGCGGCAT CCGAAATTTC ATTTGGCAAT ACGATGACCA ACACCTTCTC
TATCTTCAGG ACGCGGGAGG CGACGAGAAC TGGCGCCTCT ATCAAACTGA TATCGCCACC
AAGCAGACGA AAGATCTCAC TCCGTTCGAC AAAGTTCGCG TGGATATCGT CGCCTATTCC
TGGAAAACGC CCGACGCGAT CCTCGTCCAG ATGAACCAAC GCGACCCGAA GGTCTTCGAC
GTGCACCGCG TTGATCTTAA GACCGGCAAG GTGACGCTCG ACACGCAGAA CCCTGGCGAT
GTGGCGAGCT GGCAGGCCGA CAACTCTCTC GAAGTTCGCG CTGCGCAGGT ATCAACCGAC
GACGGCGGGA CCATCATCCG CGTGCGTAAC GACGTCAAGT CGCCATGGCG CGACCTGATG
AAATGGGGAC CCGATGAGAC CTTCGGCGGC GTCGGTGGGT TCACGCCCGA CAATCGTTCG
CTGTGGGTGA TGACCAGCCT CGATGCAAAC GCCGCGCGTT TGCTGGAGAT CGACATCGCC
AGTGGCAAGC AGAAGGTGGT TTCGGAAGAT CCGCAGTTCG ACGTTCGGGC GACGATCAAC
AATCCTAAGA CGAATGCACT CGAAGCCATC AGCTATGCCA AAGACAAAAC CGATTACATC
TTTGTAGATC CCAAGGTAAA GAGCGACTTC GCCGTGCTTT CCAAGGTGCA TGATGGCGAG
ATCGACAGCC TTTCGCAAAG CCTCGACGAC AATCGCTGGA TCGTCGGCTA CATCAGCGAC
GACGCTCCTG AGTACTGGTA CCTCTACGAT CGCCCGACGC AGAAAGCTAC GCTGCTCTTC
AGCAACCGTC CGCAACTGGA GAAGTACAAG CTCTCGAAGA TGCAGCCGAT CGAGTACACC
GCGCGCGACG GCATGAAACT CTATGGCTAC TTGAGCACGC CGGCCGGCAT GGAAGCAAAG
AACCTGCCGA TGGTCGTCTT CGTTCACGGT GGCCCGTGGG GCCGCGACGA GTGGGGGTAC
AACCGCTACG CGCAGTGGCT CGCCAATCGT GGATATGCAG TGCTACAAGT GAATTTCCGC
GGTTCAACCG GCTATGGCAA GAAGTATGTC AATGCTGGAG ATCGTCAATG GGCAGGATCG
ATGCATACCG ATCTCCTCGA CGGCAAAGAC TGGGTTGTGA AGCAGGGCAT CGCCGATCCT
GCGAAAGTTT GCATTATGGG CGGCAGCTAC GGCGGCTATG CAACTCTGGC CGGCGTGACC
TTTGCGCCCG ATGCATTCGC CTGCGGCGTG GACATCGTTG GTCCGTCGAA CCTGAACACG
CTATTGAAGA CGATTCCGCC TTATTGGTCC ACGATATTGT CCACCTTCCA CAAACGCATG
GGAGATTCGG AAGCGGTGCT CACTTCGCAG TCGCCGCTCT TCAAAGCCGA CCAGATCAAG
GTGCCGCTGC TGATCGGGCA GGGCAAAAAT GATCCCCGCG TAAACGTGGC GGAGAGCAAT
CAGATCGTGG CCGCTATGCG CAAGAACAAT AAGCCGGTGG AATACTACAT CTTCCCCGAC
GAAGGCCACG GCTTCGCCAA ACCAACCAAC AACATGGCCT TCAATGCGGC GTCAGAGGAG
TTCCTCGCCA AATACCTCGG AGGCCGTGCC GAGCCGCCAA GCGAGGCAGA AAGCAAGCTG
CTGGCCAGCA TCAAGCAGTA A
 
Protein sequence
MFSLRRIAIL ALCTLFVLPA FGEEKFTTEL IPRSVLFGNP ERADPQISPD GKQLAYLAPV 
NGVLNVWVRT LGKTDDRAVT SDTNRGIRNF IWQYDDQHLL YLQDAGGDEN WRLYQTDIAT
KQTKDLTPFD KVRVDIVAYS WKTPDAILVQ MNQRDPKVFD VHRVDLKTGK VTLDTQNPGD
VASWQADNSL EVRAAQVSTD DGGTIIRVRN DVKSPWRDLM KWGPDETFGG VGGFTPDNRS
LWVMTSLDAN AARLLEIDIA SGKQKVVSED PQFDVRATIN NPKTNALEAI SYAKDKTDYI
FVDPKVKSDF AVLSKVHDGE IDSLSQSLDD NRWIVGYISD DAPEYWYLYD RPTQKATLLF
SNRPQLEKYK LSKMQPIEYT ARDGMKLYGY LSTPAGMEAK NLPMVVFVHG GPWGRDEWGY
NRYAQWLANR GYAVLQVNFR GSTGYGKKYV NAGDRQWAGS MHTDLLDGKD WVVKQGIADP
AKVCIMGGSY GGYATLAGVT FAPDAFACGV DIVGPSNLNT LLKTIPPYWS TILSTFHKRM
GDSEAVLTSQ SPLFKADQIK VPLLIGQGKN DPRVNVAESN QIVAAMRKNN KPVEYYIFPD
EGHGFAKPTN NMAFNAASEE FLAKYLGGRA EPPSEAESKL LASIKQ