Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1037 |
Symbol | |
ID | 4599698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 1091016 |
End bp | 1093673 |
Gene Length | 2658 bp |
Protein Length | 885 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639775636 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_922243 |
Protein GI | 119715278 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGGAC GACCAAGTAT GGCCTCCAGT TTGAGCAAGT GGCGACAAGC ATTCGAGAAA CTCGATCAGA CGCCCTGGCC GGGCACGAAT CGAATCGACG ATCCGACCCA GTTGCGTGGC CGCAAGCGCG ACGTTTCCGA TATCGCTGTG GCCTGCCTAG CAAATGACCT CCTTGTCATC CACGGAGCGT CCGGGGTCGG AAAGTCCTCG CTCCTCACAG CTGGGCTCAT TCCTGAGTTG AGGCGACGAC GGAAGACCGT CGTGTACTGC AATCGCTGGG ATGCTCCTGA TGACAGCGTG GCTCCGTCTG CGCACATCAC TGAAGGTGTG CTGGAGGCGG ACCCGTTCTC ACTCCCGAAC GGCGAGTTTG AATCTAGATT CGGTGATCGA CTGGTGATCG TACTGGACCA GTTCGAAGAA GTGATTCGAA ACAATCCCGA GTTCGCGCAA CGCGTGCTTC GCTGGATCGA AGACGTAGTC GGCACGACCT CGGCCAGATT CGTCGTTTCG CTTCGGTCTG AGCAGGAGCA CGAGCTCGCC GGGCTGTATA CCAAGCCGTT CGCGCGTCGC GGACGCGTCG AGATCCCGGC AATTACGAAC CCAAGGATCA TCGAACTCAT CATCGGGGGC CCCCGTGACT CGTCCACGGA GCAGAGCACC GAAGGACGCC GACTTCCGAT CGCGGACGAC GCGATCGATG CTCTTCGAGA GGCCTGGCAA GCATCACAGG CGGATGAGAA CTCAACGAAG TGGGATCGGC CCGGTCTCCT GCACCTCCAA GCCGCTCTCT ATGTCCTCTG GATGCGTAGA TCTGCAAAGT CCGGCGGCGA CGATGGCCTT GGGCACATCG CGCTCACCGA TGTCACGGGC CTCATCCGCG AAGTAGCCAA GAGGCATGGT CTCGGCCGCG CGTCTGCACA GGCGGCACTG CTCGCATATG CGCTTGAGCT GAGCGTGTCG TGGAAGGTCG AGAACTGCGA GTCGGCGTGC ATGAGCACGC GGACCTGGCA GGGGGTTCCG GCTGCAATCG TCAATCAGAC CAAATGGATC TTCCGCGACA TCACCGAGCA TCTGTCGAGC GGCGGTTACA AGACTCCGCG AGACATGTGG GAACTCTCGA GAGAGGTCAT CGGGCACCTT CACCGGCCTG CGCATGCTGC CGTTCAGCCG GTCGCTCAGA AGCTCTACAA CGGACTCGAT CCCGAATGGC TTAGTGCACG CGAGGTCGTG CAGACGGAGG CATCGTCGGG CGAAGGAAAT CAGTACGGCC TGCAGCCAGA CTGGCTTGGC GCTGAACGCC CATCCTTCAG CGCCGACATG CGCGACGGCG GACCGTCGCC GCGCCAACTT GGTTCTGGGC CCGCGGCCGG CCTCACAAAC ACCGATGTGA TGTTCGAACT CTTCCGGTGC TATTTCTTCG CGCTCGAATG GCTGAAGCAC GCGAAGATCG CACAGTTAGA AAGCAAGCGC GACAGCAAGA TCGTAATTCT GACGCACGAT CGCTACTCGG CCGGACTTGC CCGCTGGCAC GGAAGCCAGG TCGGCAGCTT CAAGGAAGCC GTGGAACGTC TTGCCTCCCA CCGCGGCGAG GATCTGGCAT GGCACGATGT GGGCGGAAAG GTGCGTTCTG CAGCCGCGAC CCGACTAGTT GTGAACGCCA ACTGGCGCTC GTGCGCCATC CACGACACAG ACTTCTCGGG CGTGACTTTC GTGAACTGCG ACTTTGCAGG TTCAACATTC GAGAGTTGCG TCTTCGACGG AGCGACATTC GTCAACTGCA TCCTCGACCA GGTCGATTTC GTGCGATGCG CCATCAAGGG GCGCCCGACC TGGCCCGAGA AAGCAGTGCT CGACCGGCTT GCCGACGAGG CTGTCGCGAA GGCGCCCGAG TTCAGACTGG CTGCCCCGAG CGAGCTGACT GACGCGCTTC GCGCGCTGCA GGCTCCTAGT TCGACGCGCA TAACCTCAAC GACTCACTTC CACCTGTATG CAAGGGAGTC TGGCACCCCC GCAGTCACTG CGTCGGGAAG GGCGCCAGCG CCAAAACCCA CCAGGGAGCC GACGCTTCCG CTCAAGCCCG GCGGTCTCAC TGTCTGCGGC GGCCGGCTAA GTTCGCTCAC GTTTCGCACA TGCGACTTCC TTGGACCCAA CGCGACGGTC AGCCTCCACC ACATCGCGGG GACTTCTTTG GAGATTTGCG AACAGCGCGT TGGCACGTTC GACATCTTCG CAGCGGGTAT CCGAGGCCTA ACGGTCACCC GCCCCGTGGA GGACCTCGAT GAAGCCGCAC CTAGTGGCGC CTCCTCCAAC CGCGGCGGAC CACGACAGTT CACGCTCAAT GTTCATCGTG CCCGGGTAAT CAATGCCTGG TTCGGTGTAA ACCTCAAGGG CAAGGCCTCA TTCGATGACT GCCTCATTCT CCAACTCGTC AACGCAAGCG AGTCCTTTAC GCCAACGCTC TTGCGATCAA GATACTTCGG CTTGGTCAAC GCTGAGACTC CCAAGGATTT GCTGGTGGAA CCCAAGGGCT CAATCGAGAT CGCGGATGCG GGTATCGAGA CTCTGGGTGG TTTGGCGCCT GAGCTGGTGG GCCTGAGCAG AAACATCGAC TTCCGGGAGT CAGTGCCCGA CCTCGCCGTG GACTCGAAGG AGGAGTGA
|
Protein sequence | MTGRPSMASS LSKWRQAFEK LDQTPWPGTN RIDDPTQLRG RKRDVSDIAV ACLANDLLVI HGASGVGKSS LLTAGLIPEL RRRRKTVVYC NRWDAPDDSV APSAHITEGV LEADPFSLPN GEFESRFGDR LVIVLDQFEE VIRNNPEFAQ RVLRWIEDVV GTTSARFVVS LRSEQEHELA GLYTKPFARR GRVEIPAITN PRIIELIIGG PRDSSTEQST EGRRLPIADD AIDALREAWQ ASQADENSTK WDRPGLLHLQ AALYVLWMRR SAKSGGDDGL GHIALTDVTG LIREVAKRHG LGRASAQAAL LAYALELSVS WKVENCESAC MSTRTWQGVP AAIVNQTKWI FRDITEHLSS GGYKTPRDMW ELSREVIGHL HRPAHAAVQP VAQKLYNGLD PEWLSAREVV QTEASSGEGN QYGLQPDWLG AERPSFSADM RDGGPSPRQL GSGPAAGLTN TDVMFELFRC YFFALEWLKH AKIAQLESKR DSKIVILTHD RYSAGLARWH GSQVGSFKEA VERLASHRGE DLAWHDVGGK VRSAAATRLV VNANWRSCAI HDTDFSGVTF VNCDFAGSTF ESCVFDGATF VNCILDQVDF VRCAIKGRPT WPEKAVLDRL ADEAVAKAPE FRLAAPSELT DALRALQAPS STRITSTTHF HLYARESGTP AVTASGRAPA PKPTREPTLP LKPGGLTVCG GRLSSLTFRT CDFLGPNATV SLHHIAGTSL EICEQRVGTF DIFAAGIRGL TVTRPVEDLD EAAPSGASSN RGGPRQFTLN VHRARVINAW FGVNLKGKAS FDDCLILQLV NASESFTPTL LRSRYFGLVN AETPKDLLVE PKGSIEIADA GIETLGGLAP ELVGLSRNID FRESVPDLAV DSKEE
|
| |