Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0390 |
Symbol | |
ID | 4597776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 422097 |
End bp | 423287 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639775004 |
Product | peptidase U34, dipeptidase |
Protein accession | YP_921620 |
Protein GI | 119714655 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4690] Dipeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.17087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGCGACA CCATCGTCTT CCGGACCGAC TCGGGCATGG TGCTGGCGAA GAACTCCGAC CGCGACCCCA ACGAGGCCCA GCTGTGGGAA TGGACCCCCG CAGCGACGCA CGAGGCCGGT GCCCGGTCCC GCACGACGTA CGTCGACCTC CCCCAGGTCG CCTGCACCCA CGCGACTGTC GTCTCGCGGC CGTGGTGGAT GTGGGGAGCC GAGATGGGCG CCAACGAGCA CGGCGTGGCG ATCGGCAACG AGGCGGTGTT CACCAAGCAG CGGACCAGCC TGGAGCCCGG GCTGCTCGGC ATGGACCTGC TGCGGCTGGC GCTCGAGCGC GCCGCGTCGG CCCGCGAGGC CGTCGAGGTG ATCGTCGCGC TGCTCGAGGA GCACGGCCAG GGCGGCGCGT GCAGCGCGGA GCACCGCCGG TTCACCTATC ACAACAGCTT CCTGGTCGCG GACCGGGACG GCGCGATCGT CCTCGAGACC GCCGGCCGTC ACTGGGCCAG CGAGGACGTC ACCGGCGCCC GCAGCATCAG CAACGGGCTC ACGATCGCCG GGTTCGCGGA GCGGTACGCC GACCGGCTGC GCGGCCGGGT CGCCGGCTGT GCAGGGCGCC GGTCGCTCAC CGAGCGGCGG GCGGGCGGCG CGGAGGGCGT CCTCGACGCG ATCTCGATCC TGCGCGACAA CGGCACCGAC GGCGGCCCGC GATGGTCGCT GCTCAACGGG TCCATGGTCG GCCCGAACAT GCACGCGGGC GGCCTGCTCG CGTCCAGCCA GACCGTGTCC TCATGGGTCA GCGACCTCGG GTCCGGCCTG CACTGGGCGA CCGGGACGGC CGACCCGGCG CTCTCGCTGT TCGTGCCGTT GCGCGTCGAC CAGCCGCTCG CGGAGACGGC GTACCCGACC GCGGGCGTCG ACAACCGGCG GGACGACCGG TCGCTGTGGT GGCGCCACGA GCGGCTGCAC CGGACCGCGC TGTGCGACTG GACCGGCGTG GAGGCCCGGC TCGCCGCCGA GCGCGACGAG ACGCAGCGGC GTTGGGTCGC GCACCCCGTC CCGACCGCGA CCGCGCTCGC CGAGGCCGAG GTGCTGCGGG AGAAGTGGAC CGCTGCGGCG GAGGCGGGGA CCGGTGACAC CCGGCCGGCG TGGGTGCGCC GGCGGTGGGC GGGCTTCGAG CTGAAGGCGG TGGGTCGGTG A
|
Protein sequence | MCDTIVFRTD SGMVLAKNSD RDPNEAQLWE WTPAATHEAG ARSRTTYVDL PQVACTHATV VSRPWWMWGA EMGANEHGVA IGNEAVFTKQ RTSLEPGLLG MDLLRLALER AASAREAVEV IVALLEEHGQ GGACSAEHRR FTYHNSFLVA DRDGAIVLET AGRHWASEDV TGARSISNGL TIAGFAERYA DRLRGRVAGC AGRRSLTERR AGGAEGVLDA ISILRDNGTD GGPRWSLLNG SMVGPNMHAG GLLASSQTVS SWVSDLGSGL HWATGTADPA LSLFVPLRVD QPLAETAYPT AGVDNRRDDR SLWWRHERLH RTALCDWTGV EARLAAERDE TQRRWVAHPV PTATALAEAE VLREKWTAAA EAGTGDTRPA WVRRRWAGFE LKAVGR
|
| |