Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1530 |
Symbol | |
ID | 4595681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 1622315 |
End bp | 1625329 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639776128 |
Product | hypothetical protein |
Protein accession | YP_922731 |
Protein GI | 119715766 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGC TGTTCGACGA GGCCCCTCGG GACCCCGGAC CCCCGGCGCG GTCCGGTTCG CGCCGGTCCC GTGCCCTGAT CGTCACGGCC GTGGTGCTGG TGATCGGCTT CCTGGGGCTG AGCACGTTCG CGGGCATCTA CACCGACCGG CTCTGGTACG TCTCGGGCGG GTACGGCGCG GTCTTCACCA CGCTGTTCTG GACCAAGACC GTGCTGTTCT TCCTCTTCGG CGCGGGCATG GCGCTGGTGG TCGGCGTGAA CATCTACCTG GCCTACCGGT TCCGGCCGTT CTTCCGCCCG AACTCACCGG AGCAGAACGG GCTGGACCGC TACCGCGAGG CCATCAACCC GATCCGGACC TGGCTGCTGG TGGGCGTCGC GCTCGTGCTC GGCGCGTTCG CCGGCAGCTC GGCGATCGGC GAGTGGCGCG ACTACCTGCT GTGGCGCAAC GGCACGTCGT TCGGCAGCGA GGACGCCTAC TTCCAGAAGG ACATCGGCTT CTACGTCTTC GACCTCCCGT GGCTGCACTA CCTGGTCGAC TACGCGATGG CCGTCCTGGT CGTCGCGCTG ATCGCCGCCG CGGTCGTGCA CTACCTGTAC GGCGGGATCC GGCTGCAGAC GCCCCGCGAC CGGCTCTCCG GGGCCGCCCA GGCGCAGATC TCGGTGCTGC TCGGGTTCTT CGTGCTCGCG AAGGCCGCCG ACTACTGGCT GGACCGCTTC GACCTGGTCA GCCAGGGCGG CGGCGTGATC ACCGGCATGA CCTACACCGA CGACCACGCG GTGCTGCCGG CCAAGAACAT CCTCCTCGGC ATCTCGATCA TCTGCGCCGT GCTCTTCTTC GTGAACGTGT GGCGCCGCAC CTGGCTGCTG CCCTCGGTCG GCCTGGCCCT GCTCGCCGTC TCGGCGATCC TGCTGGGGCT GATCTGGCCG GGCATCGTGC AGCAGTTCCA GGTCAAGCCC TCCGAGGCGG ACAAGGAGGC GCCGTACATC GAGAAGAACA TCGAGGCGAC CCGCACCGCC TACGACGTCG CCAACGTCGA CGTGGAGAAG TACGACCCGG CGACCGCGCT CGGCGCCGGC TCCGCGAGCA TGGTCGAGGA GGAGACCTCC TCGGTGCCGC TGGTCGACCC GCAGCTGGTC CGGGACGCCT TCGAGCAGAA CCAGCAGGTG CGGGCCTACT ACTCGGTCGC CCAGGTCCTC GACGTGGACC GCTACGACAT CGACGGCAAC GACCGGGCGC TCGTGCTCGG GGTCCGGGAG CTCGACCAGA GCGGCATGAA CGCCGGCGAC CGCAACTGGA CCAACCTGCA CACCGTCTAC ACCCACGGCA ACGGCATCAT CGCGGCGTTC GCCAACCAGC GCAGCGAGGA CAACAAGACC CAGATCGACA ACGCGGACAA CACCGGTGAC CAGGCCGGCA TCGTGTGGGC CCAGGGCACC AACGCCGGGC AGGACGCCCT CGCTCGTGCC ACCGGCGGCT TCGAGGACCG GATCTACTAC GGAGAGCAGA GCCCGCAGTA CTCCGTGGTC GGCAAGGCGA CGCCGGACTC CACCGACGTC GAGCTGAACC TGCAGACGGC CGGGTCCGAC GAGGGCTCGA CGACGACGTA CGACGGCAAC GGCGACGCCA GCGTCGGCGG GTTCTTCAAC CAGCTGATGT TCGCCACCAA GTTCGGCGAG CCGAACTTCC TGCTCTCGGG GCGCGTGAAC CCCAACAGCA AGGTGCTGTT CAACCGCAAC CCGGCCGACC GGGTCGAGAA GGTGGCGCCC TGGCTGACCG TGGACAGCGA CCCCTACCCG GCCGTGGTCG ACGGCCGGAT CCTGTGGATC ATCGACGGCT ACACCACCAC CGACCGCTAC CCGCTGTCGG AGAAGGAGTC GTTCCAGACG ATGATCGACG ACTCCCTGCA GGAGGAGACC GGGCTGCGCA CCCTGCCGAC CGACGAGATC AACTACATGC GCAACGCCGT GAAGGCCACC GTCGACGCCT ACACCGGTGA CGTCACGCTC TACGCCTGGG ACGAGGAGGA CCCGATCCTG CAGGCCTGGC GCAGCGCGTT CCCCGGCACC GTCGAGGACA AGTCGGAGAT CTCCGACGAT CTGCTCGACC ACCTGCGCTA CCCCGAGGAC CTGTTCCAGG TGCAGCGCTA CCAGTTCGCC CGCTACCACG TGACCGAGCC GATCGACTTC TACCAGGGCA ACAACCGCTG GCAGGTGCCC GAGGATCCCT ATTCAAAGGG CAAGTTCCAG CCGCCCTACC GGCTCTTCGT CGACAGCAAC GGCGGCACCG ACCAGGTGTT CGCACTGACC TCGGTCTACG TCCCCTACAA CAAGAACAAC CTCGCGTCGT TCGTCTCGGT GAACGCGGAT GCGACCAGCG ACCAGTACGG CCAGATGCAG GTGCTCGAGC TGCCCAACGA GCAGACGCCG GGCCCCGGCC AGGTCGCCAA CCAGTTCGCC ACCGACCCGG AGGTCGCCAA CGAGCTGGCC CAGTTCAACC GCAGTGGCGC GCGGCCGGTG TACGGCAACC TGCTGACGCT GCCGATCAAC GACGGGCTGA TGTACGTCCA GCCGGTGTAT GCGACCCAGG CCCTCTCGGA CTCGAGCTTC CCGATCCTGC GCTACGTGCT GGTGAAGTAC GGCAACGACA TCGGCTTCGG CTCGACGCTG CGCGACGCCC TGGAGAACCT CCTCGGCGTC AGCACCGGCC CCGGCACCCA GCCCCCGGAC ACCGGCCAGC CCGGCGACAA CGAGAACCCG CCGCCCGCCA CCGGCACCGT CGCCGCGCAG ATCCGCGCCC TCCTCGCCCA GGCCCAGGAC GCCTTCGACG CCGCCGACGC GGCGCTGGCC GACGGGAACC TCGCCGAGTA CCAGCGCCAG ATCGGCATCG CCCAGGCCAA CGTCGAGGCC GCCATGGAGC TCGGCCAGAA GCGCGGCTCG GCCGGTCAGC CGTCGGGCTC GCCCTCGGGG TCCGCGTCGT CCTCCCCCTC GGAGTCGCCG AGCCCGTCCT CCTGA
|
Protein sequence | MSELFDEAPR DPGPPARSGS RRSRALIVTA VVLVIGFLGL STFAGIYTDR LWYVSGGYGA VFTTLFWTKT VLFFLFGAGM ALVVGVNIYL AYRFRPFFRP NSPEQNGLDR YREAINPIRT WLLVGVALVL GAFAGSSAIG EWRDYLLWRN GTSFGSEDAY FQKDIGFYVF DLPWLHYLVD YAMAVLVVAL IAAAVVHYLY GGIRLQTPRD RLSGAAQAQI SVLLGFFVLA KAADYWLDRF DLVSQGGGVI TGMTYTDDHA VLPAKNILLG ISIICAVLFF VNVWRRTWLL PSVGLALLAV SAILLGLIWP GIVQQFQVKP SEADKEAPYI EKNIEATRTA YDVANVDVEK YDPATALGAG SASMVEEETS SVPLVDPQLV RDAFEQNQQV RAYYSVAQVL DVDRYDIDGN DRALVLGVRE LDQSGMNAGD RNWTNLHTVY THGNGIIAAF ANQRSEDNKT QIDNADNTGD QAGIVWAQGT NAGQDALARA TGGFEDRIYY GEQSPQYSVV GKATPDSTDV ELNLQTAGSD EGSTTTYDGN GDASVGGFFN QLMFATKFGE PNFLLSGRVN PNSKVLFNRN PADRVEKVAP WLTVDSDPYP AVVDGRILWI IDGYTTTDRY PLSEKESFQT MIDDSLQEET GLRTLPTDEI NYMRNAVKAT VDAYTGDVTL YAWDEEDPIL QAWRSAFPGT VEDKSEISDD LLDHLRYPED LFQVQRYQFA RYHVTEPIDF YQGNNRWQVP EDPYSKGKFQ PPYRLFVDSN GGTDQVFALT SVYVPYNKNN LASFVSVNAD ATSDQYGQMQ VLELPNEQTP GPGQVANQFA TDPEVANELA QFNRSGARPV YGNLLTLPIN DGLMYVQPVY ATQALSDSSF PILRYVLVKY GNDIGFGSTL RDALENLLGV STGPGTQPPD TGQPGDNENP PPATGTVAAQ IRALLAQAQD AFDAADAALA DGNLAEYQRQ IGIAQANVEA AMELGQKRGS AGQPSGSPSG SASSSPSESP SPSS
|
| |