Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccur_00230 |
Symbol | |
ID | 8374231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cryptobacterium curtum DSM 15641 |
Kingdom | Bacteria |
Replicon accession | NC_013170 |
Strand | + |
Start bp | 30745 |
End bp | 32997 |
Gene Length | 2253 bp |
Protein Length | 750 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644992947 |
Product | dipeptidase |
Protein accession | YP_003150438 |
Protein GI | 256826479 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4690] Dipeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 84 |
Fosmid unclonability p-value | 0.00762022 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGTAAGT TTGCAAAGAT AGGTTGCGCA GTTGCGCTTT CGGCGTCACT TGCGGTGGCG CTGCCAGGAA GCGCATTTGC CTGTACGGGG ATCTACATTG GATCTCAGTA CGACACGGAC AGCTCGTCGT ACTTCTCGCG TTGCGAGGAC TACACCTATG TGCCGCCGTC GGCTGTGCAC CTGAAAGTGT TCGGCGTACA GGCTGCCACA AAGAACTCAG GGCAGACCTA CAACAATGCC GAAGAAATGG GCGGAGTCGA CCAGACCCAC TTTAGCCGTC CGTATCCGGC AAGCACGTAC CGGTTCAGCT ACATTCGTGA CTCAAGTGAC TATGGCGCTG GCGATATGGC GTACGCCGAG GCAGGTACTA ATGAACTGGG TGTCTCGATG AGTGCGACAG TCACGACCGA TTATAACGAC GCTGCCAAGG CGGCCGATCC TCTCACCGAC AGCGGAGTAA CCGAAAATAA TATTGGGACG ATTCTACTGG GCGAGGCCAC GTCTGCTAAG CATGCTATGC AGATTGCAGG CGACGTTATG GACCAATATG GCGCGGGTGA GAATTTCCGT ATTTTTGCAA GCGATTCAAC GGGTGAGACC TGGGTATTCA ATGCGCTTTC GGGCCATCAG TGGGTTGCTT TTAAGCTGCC AAACGACAAG TTTTCCGTTG ACCCCAACAT GGGGCGCCTT CAGTACAAAA TTAACCTAGA CAGCTCCGAT GTGCTTCATT CCGAAAAGGT AAAGCAGCTT GCCGAAGACA ATGGCTTCGC CAAGACGTTT GCCGATGGAA GCTTTGACGT TTCAACTTCG TATGGCAAGG CAAATTCAGG AGCGGGGCAG TATTCGCGCT TCTACATGGG CGTCAATCAG CTTGACCCGG CGGTTGCTGG ATCGCTTCAG ATTACCAAGG ATGCCAATGG GAAGTTAACG GACATTCAAG ATCCGCAGTA CTTGTACAGC TCTTCAACGT TGAAGATGAC CGGTCCGTCC AAGCTGATGA ACGCTCTGCG CACGCGGGGT CAGGGGTCTG AATTTGATGC GACCACCAAT AGCAATCTGT ATCCGTTGAG CAATCCCTAT CAGTTGGAGT GCCATATCTT CCAGACGCGT GCTGGCGACT TGCCGGTGGG CATGAAGACC ATTCAGTGGC AGGCAACGGG ACGTAACGGA TATAACGTAT TTCTGCCGTC ATATTCCGCG CTGCTTTTAC CTGATGGCGA TGATGGTTTC AAGGGAACGG TTCAGCATGA TATCGGTGCG CTCGCACAAG ATACCGGCAA CCAGACAACG CCTTCCATCA ACAAGCTCAA CAGTCTTGAC GGCGTTGAAA ATTCCCAGTA TTACACGATG CTTGAGCTGA ATAATCTCGT TGATACCAAG CCGGATCTCT ATGCGAAGAA TGTATCCAAG TATCTGCAGA ACCTGCAGAA CAAACTTATC GACGAGCAGT CTGCCGTTGA CCAGAAAATG CTTGCACTGC CGGCTGATGC GCGTGAAGCA AAAGCAACCG ATGTTGCTGA TAAGCTTCAG ACACAGCTCA CGCAAAAGAC CATCAAGCTT ATTAAGGAAA TGAAGGCATA CGAGCAGGCG GGGGATACCA GCAAGCCGTT TATGCCATCC GAACTCAATG CTGATGGTAC CAATAATGCG TATCTGGATT ACCTGAGTCT CTTTGAAGAA CCAGCTCCTA ACCCCGAACC AACGCCTGGT TGGGCGCAGG ATGATAACGG TCAGTGGTCC TATACCAATG CTGATGGTAG TAAGGCTGTT GCCTGGCAAG AAGTAAATGG TACCTGGTAC TACTTCAACG ATGAAGGCAT CATGCAGACC GGCTGGGTGC AGCTTGATGG TGCGTGGTAC TACCTGCAGT CTTGGGGCGG CATGGCTCTC GACTGGCAAT TAGTCGACGA CACCTGGTAT CACTTTGATT CAGCGGGTGC CATGCAGACC GGCTGGTTGC AGCTTAGTGG CGCCTGGTAT TACCTTACTG ATTCGGGTGC TATGGCCACC GGTTGGATTC AGGTTGACGG CACGTGGTAT TACTTTGCCA ATGACGGTGC TATGCAAACC GGCTGGGTCT ATGTCGATAA CGACTGGTAT TACCTGAACG CTGATGGTTC CATGGCAACT GGCTGGCTAC AGCTCGATAG CACGTGGTAT TACTTCAAGA CGTGGGGCGG CATGGCTGCA GGAAACTATC CTGTTGATAA TGTCATGCAG CGCTTTGCTT CTTCTGGAGC GTGGATTGGC TAA
|
Protein sequence | MGKFAKIGCA VALSASLAVA LPGSAFACTG IYIGSQYDTD SSSYFSRCED YTYVPPSAVH LKVFGVQAAT KNSGQTYNNA EEMGGVDQTH FSRPYPASTY RFSYIRDSSD YGAGDMAYAE AGTNELGVSM SATVTTDYND AAKAADPLTD SGVTENNIGT ILLGEATSAK HAMQIAGDVM DQYGAGENFR IFASDSTGET WVFNALSGHQ WVAFKLPNDK FSVDPNMGRL QYKINLDSSD VLHSEKVKQL AEDNGFAKTF ADGSFDVSTS YGKANSGAGQ YSRFYMGVNQ LDPAVAGSLQ ITKDANGKLT DIQDPQYLYS SSTLKMTGPS KLMNALRTRG QGSEFDATTN SNLYPLSNPY QLECHIFQTR AGDLPVGMKT IQWQATGRNG YNVFLPSYSA LLLPDGDDGF KGTVQHDIGA LAQDTGNQTT PSINKLNSLD GVENSQYYTM LELNNLVDTK PDLYAKNVSK YLQNLQNKLI DEQSAVDQKM LALPADAREA KATDVADKLQ TQLTQKTIKL IKEMKAYEQA GDTSKPFMPS ELNADGTNNA YLDYLSLFEE PAPNPEPTPG WAQDDNGQWS YTNADGSKAV AWQEVNGTWY YFNDEGIMQT GWVQLDGAWY YLQSWGGMAL DWQLVDDTWY HFDSAGAMQT GWLQLSGAWY YLTDSGAMAT GWIQVDGTWY YFANDGAMQT GWVYVDNDWY YLNADGSMAT GWLQLDSTWY YFKTWGGMAA GNYPVDNVMQ RFASSGAWIG
|
| |