Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1667 |
Symbol | |
ID | 4600046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1771554 |
End bp | 1772630 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639776266 |
Product | hypothetical protein |
Protein accession | YP_922867 |
Protein GI | 119715902 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0262372 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGGT TCCCCGGCTT CCTCGGATCG GCCCTGCTCG GGCGCCTGCT GGCCCGGCGG GAGGGGGTCC GGGCGATCTG CCTGGTCCAG CCCCGGCACC TTGCGGAGGC GCGCACCAGG CTCGCCAGGA TCGAGGCCGA CCAGCCCCAC GTCGCCGGCC GCGTCGAGCT GGTCGAGGGC GACCTCACCG CGCCGGGCCT CGACGTCGCA CCCGCCGCGC ACGCCGGTCT GGCCGAGGTC ACCGAGGTGT GGCACCTGGC CGCCGTCTAC GACCTCACGG TGCCGGAGGC GGTCGCCCGA CGGGTCAACG TCGAGGGCAC CGAACGGATC CTCGAGTTCT GCCGCACCCG CGCGCGCCTC GACCGGCTGC AGTACGTCAG CACCTGCTAC GTCAGCGGCC GTCACCCCGG CCGCTTCTCC GAGGACGACC TCGACACCGG GCAGGAGTTC CGCAACCACT ACGAGTCCAC GAAGTTCGAG GCCGAGCTCC TCGTCCGCCG GGCGATGGCC GACGGTCTGC CGGCCACGGT CTACCGGCCC GGCATCGTCG TCGGTGACTC GCGCACCGGG GCGACGCAGA AGTACGACGG CCCCTACTTC CTCGCGACCT TCCTGCGGCG GCAGCCACGG TGGGCCGCGG TGGTGCCCGC GGTCGGGGAC GCCGACAGCG TGCGGTTCTG CCTGGTGCCG CGAGACTTCG TCATCGACGC CATGGACGCG CTCTCGGTGC TCGACCGCTC GGTGGGTCGC ACCTATGCGC TGACCGACCC GCACCCGCCG ACGGTCCGCG AGCTGGTGAC CACGTTCGCG CGCCACCTGG GACGCCGGGT CGTGTGGCTG CGGCTGCCGC TCGGCCCGAC CCGCGCCCTG GTGGGGCTCC CCGGAGTCGA GCGCCTGCTC GGCATCCCCG CCGAGTCGCT GGACTACTTC GCGTCGCCGA CGAGCTACGA CACCTCGCAC ACCTCCGCGA ACCTCGAGGG GACCGGTGTG GTGTGCCCGT CCTTCGACTC CTACGCGGGC CGGCTCCTCG ACTACATGGT CGAGCACCCG GAGATCAACG CCGCGGCGAT GGTCTGA
|
Protein sequence | MTGFPGFLGS ALLGRLLARR EGVRAICLVQ PRHLAEARTR LARIEADQPH VAGRVELVEG DLTAPGLDVA PAAHAGLAEV TEVWHLAAVY DLTVPEAVAR RVNVEGTERI LEFCRTRARL DRLQYVSTCY VSGRHPGRFS EDDLDTGQEF RNHYESTKFE AELLVRRAMA DGLPATVYRP GIVVGDSRTG ATQKYDGPYF LATFLRRQPR WAAVVPAVGD ADSVRFCLVP RDFVIDAMDA LSVLDRSVGR TYALTDPHPP TVRELVTTFA RHLGRRVVWL RLPLGPTRAL VGLPGVERLL GIPAESLDYF ASPTSYDTSH TSANLEGTGV VCPSFDSYAG RLLDYMVEHP EINAAAMV
|
| |