Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4228 |
Symbol | |
ID | 4596742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4469118 |
End bp | 4471079 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 639778834 |
Product | hypothetical protein |
Protein accession | YP_925412 |
Protein GI | 119718447 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCCGC CAGCGGTCCG GGACGCTCGT TCATGCACTC TGTGGGGAGG TCAGTCCATG GCCGTGGGCC GCCGCGCGCT CGCCGTCATC GTCGCCATCG CCGTCGCGCT GGTCACCGGG TCCACCATCG CCCTGGCCAG CACCGGGCAG GCCGCGGGCG AGCCCGCGCC GGCCCGGCTG CATCCGGTGC TGGTCAGCGG GTCCGGGGTC GGCGCCCTGA CCTGGCACGG GCCGGGGGCC ATGCTCGGGG AGTACGCCGC GAGCATCACC GCCGGCGCGA TCGGCATCGC GGACCAGCCG CGGCTCGCCA CGAGCGGCCA GGCCGACACG GTGCGCATCG GGAAGTACGC CACCAGCGGG CCCGCGGCGA TCAGCCTGGC GCTGATCCGC GGCAACGCCG TCCAGCTGAT CGGTGGCGTC GACCTGCCGG CGGCGGCCGG GGTGTACCAG TACGGCTTCG TCGACGGCGC CGGCGCGCCG ACGACGCTGC CGTTCCAGGC CGGCGACCGG CTGGGCATCG TCAACGAGTC GAGCGCTCCG CTCCCCGTGA TCGCCAAGGC GAGCACGGCG TACAGCGAGG AGGAGTGCAA CCTCGACGCC GCGAACGCGC TGACCGGCTG CACCATCGCG ACCGTCGGGT GGCGGCTGCA GGTTGACCTG GGCGTCGGCG AGGCGGCGCC GAGCGGCCCG AGCACGTCCG CCTCCCCCAC GGGCCCGGCG AGCCCGACCG GCTCCGCGAC GCCGACCGGC TCCGCGTCAC CCACCGGCTC GGCGTCGCCC ACCGGCTCCG CGTCCCCGAC GGGCTCGGCG TCCCCGACCA CCCCGACCGG TTCCGCGACC CCGAGCGACC CGAGCGACCC GGGCGGCGTC GACCCCGGGA CCCCGGGGGC GCCGGTGACC AACTGCCCGA CCGGGATCGC GGCCGACGGC TGGCGGATCC CGAAGAGCAC TCGGGCGCCG CGCTCGCTGG TCGGGTTCGA CCTGGCCAAG GACGGCCGCG ACAAGCTGCG GGGCACGCTG GTCACCTCGC TCACCGACAA CCTGGTCGAG CGGCTGCCGA GCGGCCGCGG GGCGTACTAC AGCAGCTCGA CCCGCGCGTC GGCGTACTCC CTGGCCGGCG CGACCCAGGC CGTCGCCTGG TCCGAGCATC ACGACGTCAC GCTCGACAGC GGGCTGCTGC TGCCGTGCGA GGCCGTCTAC GCGACGCTGC GGGCGGGCGG GCCCGCACAG CCGCTGCTGT GGGCGAAGTC CGGCGGCCTC GACCGCCTCG ACGCCGCCGT CGACGAGGCC GGGCGGCTGG CGCTGGTCGT CACCAACGCG GCGCCGCTGG ACAAGGACGT GCGTGCGCTG TACTGGCCGG CGGTGCGCAA GGACGGGGTG TACGACGTGC GGGCCGGCAC CCGGCTGCCG CTGCGCGGCG TCGAGACGGT GCGGGTCGCG ACCGCCGGTG GGCGGGTCGT GATCGCGGTG GCCGGGTTCG GCCGGATGGC GGTGTACACC GGGCCGATCA CCGGCCAGCT CCGGGAGGTG TTCCGGGCCC GCAAGGTCGA CCCGTCGCTC GACGTGGCGG TGGGGCGCGG CGGCACGCCC GTGGTGGCCT ACGCCAGCGA CGGCCGGCTG CACCTGCTGG TCGGCGGGCG CGACGTGGAC ACCCGCTACC CGGCGTACGG CGCGGCCATG GCGCTGTCGA AGTCCGGCAC CGTCGCGCAC CTCGCCTGGT CGGTCAAGCC GACCGCGGAG CCGTGCCCGA CCAGGTCCGG GCTGCTGTAC AACTGCGCCG GCCGCAACGC GCTGCTGGTC GCCGACGCCG CACTCTCGAA CGGGCGGCTG CGGCACGTCG GCATCGGCGC CTGGCTCGGC GACGGCGTCG CCCCGCAGGT CGTGGTCGGG CCGGGCGACC GCGCGGCGGC GTACTACATC CCGCCGACCA GTCGTCGGCT CGTGGTGCGG CCGGTGCCGT GA
|
Protein sequence | MMPPAVRDAR SCTLWGGQSM AVGRRALAVI VAIAVALVTG STIALASTGQ AAGEPAPARL HPVLVSGSGV GALTWHGPGA MLGEYAASIT AGAIGIADQP RLATSGQADT VRIGKYATSG PAAISLALIR GNAVQLIGGV DLPAAAGVYQ YGFVDGAGAP TTLPFQAGDR LGIVNESSAP LPVIAKASTA YSEEECNLDA ANALTGCTIA TVGWRLQVDL GVGEAAPSGP STSASPTGPA SPTGSATPTG SASPTGSASP TGSASPTGSA SPTTPTGSAT PSDPSDPGGV DPGTPGAPVT NCPTGIAADG WRIPKSTRAP RSLVGFDLAK DGRDKLRGTL VTSLTDNLVE RLPSGRGAYY SSSTRASAYS LAGATQAVAW SEHHDVTLDS GLLLPCEAVY ATLRAGGPAQ PLLWAKSGGL DRLDAAVDEA GRLALVVTNA APLDKDVRAL YWPAVRKDGV YDVRAGTRLP LRGVETVRVA TAGGRVVIAV AGFGRMAVYT GPITGQLREV FRARKVDPSL DVAVGRGGTP VVAYASDGRL HLLVGGRDVD TRYPAYGAAM ALSKSGTVAH LAWSVKPTAE PCPTRSGLLY NCAGRNALLV ADAALSNGRL RHVGIGAWLG DGVAPQVVVG PGDRAAAYYI PPTSRRLVVR PVP
|
| |