Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3972 |
Symbol | |
ID | 4598107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4189469 |
End bp | 4192450 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639778577 |
Product | PKD domain-containing protein |
Protein accession | YP_925156 |
Protein GI | 119718191 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCGCGC CCGGTGAGAC GGTCACGTTC ACGGTCTCCA GCGACGCGAG CGGGAACCCG ACGTTCGTGT GGAACGTCGA CGGGCTCGAC GTCCAGTCCG ACAGCTCGTC GCTCCAGTGG GCGTTCCAGG GCGAGGGTCA GCACACGGTC ACGGTGACCG TCGACGACGG GTTCGACTCC GGCACCGCGT CGACGTCGAT CGAGGTCCGC ACGCCCCAGC CCAACCACCC GCCGTCGGTG ACCCTCGACG CCGACCACCA GCAGGCCGCC CCCGGCGAGC CGGTCACGTT CACCGCGAGC TTCGCCGACC CCGACCCCGG TGACTCCGTC GCGCTCGCCT GGTACGTCGA CGGCGACCGG GCCGCCGAGA ACGGGCTCAC GTTGACCCGT TCGTTCGCGG CGACCGGCAC GCACGGCGTC CGGGTCGTCG CGACCGATCT GTCGGGGGCC TCGAGCGAGG CGTCGATCCA GGTGAACGTG GTGGGCACGG CGCCGGAGGC GGCGATCGCC GTACGCACGC CGTCGCCGAA GACCCGCAAG GTGACCGTCC TCGACGCGTC CGGCTCGGTC CCGGCGTCGC CGTCGGGCTC GATCGTCTCC TACCACTGGG ACCTCAACGG CAACGGCACC TTCGAGACCA CCTGCGCCGG ACCGGTCGTC GGCGTGATCA GCGCCGCGGC GGGCGACCAC CCGGTCTCGG TGCTGGTGAC CGACGACAGC GGCGGGACGT CGACGCCGGT CACGACGGTC CTCTCGCTCG CCACCTCGCG GCTCGACCAG GAGTACGCCC CCGGCGCGCT CGCCGTCACC GCGGGCGGCT GCGCGGGACC GGCCAAGGAC GGGGTGGTGC CCGAGGGGTA CCCCGCCGAC GACCTCACCT GCTACACGAC GGTCCGCGCC GGCATCGCCG AGGCGATGTC GACCTGCTTC CGCCGGCGTA CGGCCCCCGT CGGTGACCGG CTGCTCGTGA AGGAGCTCTA CGCCAGCACC AGGACCGTCC GGCTCAACGG CATCGACGTC CGGCCCAGCT CCGGCGTGGC CATCGAGATC GACACCTGGA CCGCGGAGGT CAAGACGATC GGCGGCAAAG CGAAGGCCTC GGTCGACGCC GGCAGCACGC TGGGCAAGCT GACGTTCTAC TACGGCAGCA TCAGCTGGGA CCTGCCCAGC GGCAAGGCGT CGCGCTTCAA CCTCGGCAGC CTGTCGATCA CCAAGGGCGC CGAGCTGTTC GGGCTCTCGG TCGAGGGCGA CGCCCGCCTC GACCTCGTCT ACCGCGGCGC CGAGGTGCCC GTGACGATCC ACCTGCCCGC ACCGCTGGAC GTGTCGGCGA CCGTCACCCT CAAGACCGAC AACCTCAAGG GGCTGCGCCT CGAGGAGGTG CACCTGCGCG TCAAGAACGC GACCTTCGGG GCGTTCACGG TCAACGACCT CGACCTGCTC TACAACGCGG CGGCCTACCA GTTCGACGGC TTCGCGGACC TCTCCCTGAC GTCCGTCGGC AACCTGCAGG TGAGCATCCA GGTCGTCGGC AGGACCGTCA CCATGTTCGC CGCGAACTTC ACGCCCGTGC CCCCGCTGGC GCTCGGGTCG GGGGTGTTCC TCCAGCACAT CGACTTCGGG TACGACGCCG GCCCGCCGCT GACCCTCAAC GGCGGCGTGA AGCTCACCGC CGGACCGCCG ATCAACGGCA CCGCGGCCGC CGCCATCGAG GGGACCTTGA AGTTCGTGGC GTCCGACCCG TGGCTGCTGC GCGCCGACGG CAACGCGTCG ATCGCGGGCT TCGGCGTGGC CAGCGCCTAC CTGCAGTACC AGTCGAACGG CATGATCCGG CTCGGCGGGC GGATCGACGC CCGGCTCTAC GACATCGTCA GCGCCAAGGC CAACATCGAC ATGTGGCTCT ACGCGCCCAC GATGAAGTTC AACGCCCAGG CCCGCGGCGA CGTCTGCGTC TGGAAGGGCT GTGGGGGCGG CGCGCTGGTG GTGTCCAGCA CCGGCGTCGG CGGCTGCGTC TACACGTTCT TCGCGGACTT CGGCCTCGGG TACCGGTGGG ACGGGGAGTT CAAGTACTAC CTGACCGGCT GCGACATCAA CCACTGGGCG TCCGCGTGGG ACGGCGGCGC CCGCAACCGG CTGGCGTCGT ACGCCAGGAC CGCCGCGGTC AGCTCCGAGG ACATGGTCGT CGCCCGCGGC GAGCGGGCCG TCTACTTCCG GTTCGCCTCC GTCGACGACC CGCCGCAGGT CACCGTGACC GGACCCGACG GCACCGAGAT CGCGGTGCCC GCCGGCACCG AGAACTTCGA GGCGACCGAC GACCACATCG TCCTGCAGGT GCCGCCCGAG CACGCGACGT ACGTGATCGT CCGCGACCCC GCGCCCGGAC GCTGGCAGGT GCGCACCGCG GACGGGACGC CCGACCTGGT GGCGGTCGGG CAGGCCTCGG CGCTGCCGAA GCCGCGGGTG CGGGCGAGCG TCGGCGGTCG TGGGCACGCC CGGACCCTGG ACTACCGGCT GAACGCGGTC GAGGGGCAGA CCGTGCAGTT CTTCGAGCGC GCCGGGCGCA CCACCCAGCG GCTGGGCACC GTCTCCGACG CGCAGGGCCG GATCCGGTTC AGCCCGGCGC CGGGGCCGGC GGGCGAGCGG TCGATCATCG CGGTGGTCCA GCACGACGGC GCACCGCGCG AGCAGGTCAC CGTGGCCAGC TACCGGGCTC CCGGGCCGCT GCGCCCGGCC GCGACGCGTC GGATCGCCGT CAAGCGGACC GCGACGCGGG CCGTCGTCAC CTGGGCGGAC GCGCGGCGCG CGGCGAGCTG GCGGGTGGTC GCCACCGCCG ACGACGGGCG CCGGTGGTCG GTGCGGCTGG ACCGCCGCAC GCTGGTCCTG CCCAGCGTGT TCCGCGGCAG GACCGTGACC GTGACGGTGC GCGGGGTCAG CGCGGACCAG GTCGCCGGGC CGGCGAAGCG ACAGGTCTCA CGCGGTCGGT GA
|
Protein sequence | MAAPGETVTF TVSSDASGNP TFVWNVDGLD VQSDSSSLQW AFQGEGQHTV TVTVDDGFDS GTASTSIEVR TPQPNHPPSV TLDADHQQAA PGEPVTFTAS FADPDPGDSV ALAWYVDGDR AAENGLTLTR SFAATGTHGV RVVATDLSGA SSEASIQVNV VGTAPEAAIA VRTPSPKTRK VTVLDASGSV PASPSGSIVS YHWDLNGNGT FETTCAGPVV GVISAAAGDH PVSVLVTDDS GGTSTPVTTV LSLATSRLDQ EYAPGALAVT AGGCAGPAKD GVVPEGYPAD DLTCYTTVRA GIAEAMSTCF RRRTAPVGDR LLVKELYAST RTVRLNGIDV RPSSGVAIEI DTWTAEVKTI GGKAKASVDA GSTLGKLTFY YGSISWDLPS GKASRFNLGS LSITKGAELF GLSVEGDARL DLVYRGAEVP VTIHLPAPLD VSATVTLKTD NLKGLRLEEV HLRVKNATFG AFTVNDLDLL YNAAAYQFDG FADLSLTSVG NLQVSIQVVG RTVTMFAANF TPVPPLALGS GVFLQHIDFG YDAGPPLTLN GGVKLTAGPP INGTAAAAIE GTLKFVASDP WLLRADGNAS IAGFGVASAY LQYQSNGMIR LGGRIDARLY DIVSAKANID MWLYAPTMKF NAQARGDVCV WKGCGGGALV VSSTGVGGCV YTFFADFGLG YRWDGEFKYY LTGCDINHWA SAWDGGARNR LASYARTAAV SSEDMVVARG ERAVYFRFAS VDDPPQVTVT GPDGTEIAVP AGTENFEATD DHIVLQVPPE HATYVIVRDP APGRWQVRTA DGTPDLVAVG QASALPKPRV RASVGGRGHA RTLDYRLNAV EGQTVQFFER AGRTTQRLGT VSDAQGRIRF SPAPGPAGER SIIAVVQHDG APREQVTVAS YRAPGPLRPA ATRRIAVKRT ATRAVVTWAD ARRAASWRVV ATADDGRRWS VRLDRRTLVL PSVFRGRTVT VTVRGVSADQ VAGPAKRQVS RGR
|
| |