Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0102 |
Symbol | |
ID | 4600075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 115531 |
End bp | 116544 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639774712 |
Product | thermostable dipeptidase |
Protein accession | YP_921334 |
Protein GI | 119714369 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGTGC CGACGCTGCC GGTGGCGGAC TGCCACAACG ACCTGCTGCT GGGTGTCCTG CACCAGCGCG AGCGGGGCAT CGCCGACCCG TTCGGCGACT TCTGGCTGCC GCAGCTGCGG GCCGGCGGCG TGGCCCTCCA GGTGCTCCCG GTCTTCACCG AGGAGCAGCA CGTGGGCGAG GGCGCGCTGC GCCGGAGCCT GCAGGTGCTG GAGGAGGCCC GCCGCCTGGC CGACGTGCAC GTGGCCGACG TGGCGATCTG CGAGCGGGGC GACCAGATCC GGCCCACCAT CGAGAGCGGC CGCATCGCGC TCGTCCTCGC ACTGGAGGGC TGTGAGCCGG TCGGCCACTC CCTCGAGCTG CTCGACACCT TCCACCGCCT CGGCGTCCGG ATCGCCTCGA TGACCTGGAA CCGCCGCACG ATGATGGCCG ACGGGGTCGG CGAGCAGGAT GCCGGCGGCC GCCTGACGAC CCTCGGCCTC GAGGCGGTCG CCGAGATGGA GCGGCTCGGC ATGCTCGTCG ACGTCAGCCA CCTCTCGGAG ACCGGCTTCT GGCACCTGGC CTCGGTCGCG ACCAGGCCCT TCGTCGCGAG CCACTCCTCC TGCCGGGCCC TGCAGCCGCA CCCGCGCAAC CTCACCGACG AGCAGATCCG GGCGGTGGCC GACAGCGGGG GCTTCGTCGC CATCAACGGG TTCGGCCCGT TCCTGTCCGA CGCCCCCACG GTGGACTCGT TCCTCGACCA CGTCCAGCAC GCGGTCGCGC TGGTGGGTCC CGAGCGCGTC GCGCTCGGCC TGGACTTCAT GCGCGACCTC GTGGACGCGG TCGACCCCGT GCTCTCCGGT GCGCTCGTCC ACCCCGACAC CCCGCCCTGG GTCGCCGGCC TCGAGCGGCC CGCCGACCTC GCCGCGCTCG CGGCGCGGCT GGAGGAGACG CTGGGGCCGC GAGCCGGCCG CCAGGTGGCT GCCGACACCG TCATCGACAC CCTGACGCGG CAGCTGGCTC CGGCGCCGCG CTGA
|
Protein sequence | MVVPTLPVAD CHNDLLLGVL HQRERGIADP FGDFWLPQLR AGGVALQVLP VFTEEQHVGE GALRRSLQVL EEARRLADVH VADVAICERG DQIRPTIESG RIALVLALEG CEPVGHSLEL LDTFHRLGVR IASMTWNRRT MMADGVGEQD AGGRLTTLGL EAVAEMERLG MLVDVSHLSE TGFWHLASVA TRPFVASHSS CRALQPHPRN LTDEQIRAVA DSGGFVAING FGPFLSDAPT VDSFLDHVQH AVALVGPERV ALGLDFMRDL VDAVDPVLSG ALVHPDTPPW VAGLERPADL AALAARLEET LGPRAGRQVA ADTVIDTLTR QLAPAPR
|
| |