Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4476 |
Symbol | |
ID | 4596995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4731861 |
End bp | 4732991 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639779087 |
Product | protein of unknown function DUF1100, hydrolase family protein |
Protein accession | YP_925660 |
Protein GI | 119718695 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAGTACG GTCTGCAACG GTGCGCGCGG AGGGGGTGTC AGGTGGACGA CCTCGTCGAC TCCGCGGTGT CCCACTGGGG CCCGCGGTTC ACGGTCAACG GCGTGACGGC CGCCGACTTC GACCGGGTGA CCGCCGGGAT CGGGAGCTGG GACGACTGGT GCGCCGCCTG GGTGGCCGCC GGCGCGGCGC ACGAGGCGCT CGGGCGAGAT GCCCTCGCCG AGGGCCGGAC CCGCTCGGCC GGCGAGCACC TCGCCCAGGC CGCCGTCTAC CACCACTTCG CGAAGTTCGT CTTCGTCTGC GACCTCGACC AGATGCGCGC GGCCCACGCC CGCGCGGTCG CCTGCCTCGA CGCCGCCCTG CCGCACCTCG ACCCGCCGGG CCGTCGCCTC GAGATCCCGT TCGAGGGCAG CCGGCTGGCC GGCGTGCTCC GGCTGCCCAC CGGGCCGGGC CCGCACCCGG TCGTGGTGCT GCTCTCCGGC CTGGACTCGG CCAAGGAGGA GCTCCGCTCG ACCGAGCAGA CCTTCTTGGA CCGCGGGCTG GCGACGCTGA GCGTCGACGG CCCCGGCCAG GGCGAGGCGG AGTACGACCT GCCGATCCGC GGCGACTGGG CGCCGGTGGC CGAGGCCCTC TGGGCGGCGC TCGGCACCCT GCCGGAGGTG GACCGCGACC GGCTCGGCCT CTGGGGCGTC AGCCTCGGCG GCTACTACGC ACCCCGCGTG GCGGCCGCGC TCGGCGAGCG GGCCCGTGCC TGCGTGGCGC TGGCCGGGCC GTTCAACTTC GGCGGGTGCT GGGACCGGCT GCCGCAGCTG ACCCGGGACA CGTTCCGGAT CCGCTCCGGC GCCGCCAGCG AGGAGGAGGC CCGGAGGATC GCGCTGACCC TCGACCTCGA GGAGACCGCC CGCGACGTGG TCGCCCCGCT GCTGATCGTC TTCGGACGCC AGGACCGGCT GATCCCCTGG GAGCACGCCG AGCGCCTGCG CGACGCCGTC TCGGGACCGG TCGAGCTGCT GATGCTCGAG GACGGCAACC ACGGCTGCGC CAACGTCGCG CCCTGGCACC GGCCCCGCAC CGCCGACTGG CTGGCCACCC GGCTCGCCGT ACCCGCACTA CTGACGAAGG CAGGATCATG A
|
Protein sequence | MQYGLQRCAR RGCQVDDLVD SAVSHWGPRF TVNGVTAADF DRVTAGIGSW DDWCAAWVAA GAAHEALGRD ALAEGRTRSA GEHLAQAAVY HHFAKFVFVC DLDQMRAAHA RAVACLDAAL PHLDPPGRRL EIPFEGSRLA GVLRLPTGPG PHPVVVLLSG LDSAKEELRS TEQTFLDRGL ATLSVDGPGQ GEAEYDLPIR GDWAPVAEAL WAALGTLPEV DRDRLGLWGV SLGGYYAPRV AAALGERARA CVALAGPFNF GGCWDRLPQL TRDTFRIRSG AASEEEARRI ALTLDLEETA RDVVAPLLIV FGRQDRLIPW EHAERLRDAV SGPVELLMLE DGNHGCANVA PWHRPRTADW LATRLAVPAL LTKAGS
|
| |