Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4737 |
Symbol | |
ID | 4595472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008697 |
Strand | + |
Start bp | 35200 |
End bp | 37470 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639772526 |
Product | hypothetical protein |
Protein accession | YP_919186 |
Protein GI | 119714044 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0599704 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTGCCGG TGACTGTACC GGCGTCGCGT GGCGATGCCG CACACCTTGA TGCAGGTCGG CGTGTCTCGC CGGCGCTGTT GGCGCTGGAT GAGCACCTGT TCAACCCTGA CGCGTCGCAG TGGCTGGTGA TCGACTCCGG CAACACGGCT GTTCCCCGGA CCGTGGTGCC GTCGCTGGTC GACGGGCTCG AGCTGATCGG CAAGGGCCGT ATCGCCGCCC TGGTCGGGCA CCTGCGCGAC GGCGTCGGCG TGGTCGACGT CGACGTGCCC GGCGAGTTCG GTGACTTCCT CGCGGCCGAG GTCGCCGACT GGCTGTCCCG CCGCGACTGC TGGGTGCTGG AGCGGCCCTC CGGCGGCGCG AAGGGTCGCT GGCACATCTT CTTCGCCCAC TCCGACTTCC ACTACGCCCC CGCCGCTGCC CGGGCCGGGT TCGCGGCTGC GGTCAGCGGC TTCCTCACAG CCCTGGCCGA GGACGTGAAG GTCCCGCGCG GTGAGCTCGA CCTGCGCGAC GCCGTGCGTC CCCTTTCCTC ACCTCACCGG TTCGGTGCGG TGACCAGGCC GAAGGGCGAG CTCCGTGAGG CGCTGCGCGG CCTGAAGCGA GTGCTCCCGG ACCCGCCCTC GCCGTCGCCG CTGCGGCCCC GCGCCAAGGT GAAGCCCACC AACGCTGGCC ATAAGTCCAC TGCGACCGGC TCGGGCCTGG TGGTGCCGTT GGCGTTGCAG CGGTGGAAGC GCCAGCTGCG CACTGAGTGG CGCAATTACC TGCTGACCGG GGAAATCCCG GCCGGGTCGT GGGCCGCGGG CGCGACGAAG ACCCGCGCCG CGGTCGAGGT CGACCGATCC CTCGTCGAGG CGGCCTGCAC CCGGGAGATG GTGTGGGCGA TCGGCGACCC GGAGATGGCC TGGCGGATCA TCCGTGAGTC GCATCCGACC GCGATGACCA AGGCCAAGCA CCAGGGCTAC TCGTGGTGGC TCGGCTACGT GTGGAACGAC CTCGTCCGCT CGGCCAGCGA GTTCAACACC ACCAGCGAGA AGCCCAGGCA GGTCGAGGCG CCACCGGTCG AGGTCGTCGA GGCGGTAGCG GCTGCCCGCG CCGAGCTAGG TCGTCTGATG TGGTCTGTCC CGGACCGCCA GCGGGCGGCG CTGCTGCTTG TCGGGCACCA CCTCCTTGAC CGCGTGCTGC GCAAGCGAAA CCTGCGTGTG CCCTGCCCCG AGCGGGACCT GCTGCTCGAC ACCGGGCTGG GCGACCGCAA GACCGTCCGC GCCGCCCTGG CCCGCCTCAA CGGCCGCCTC GGGACTCTCC ACACCGACTG CCTCTCCCCC GTCGAGCGGG ACTCCACGAG CTACGAGTTT GAAATCAACC AGGCCCCGGA AGGGGAGGGA CGGCAAATCC CCCCACCTGG GTTTGACCCA CCCCCCGCCC CCCGCGGCCT CTGGGCGACC CTCCCCCGCT CCAGCCACAG CCTCTGGCGC ACGCTCCTGA CCTGCTCGAC CCCGCTGGAG CTGGGGGATC TGGTGGTGAA GGCAGGGCTG GTGAAGGCTG CAGGTGATGA GGTGTCGAAG TCTCAGAGGT CCACGGCGAA GGCGGCGCTG GTCGCGCTGA GCAAGGCTGG GATGGTTCGG GTTGACGAGA ATGGATGCTG GCAGGCAGCC ACGCGGCCCC GGTCGGTTCA GGTCGAGCAG GACGCGGCCG CGGCGTACGC CCGTCAACTG GAGACGATCG AGGCAGAACG AGCCGCCTAC CGCGCCGGCA CGACGTCGAG CTGGACCGCA GGCCGGGCGC GCGCCATCAA GGCACAGAGG GCCAAGGAGA AGGCCTGGTG GGACAACCTC TCCCCCGCCG CCCGCGCCGA ACGTGCCGCA GCGAAGCGCC TCGAGTTCGA CCAGATGTCG ATCAGCCAGC AGGCCGCGCT GAAGTCCCGT CTCGCCGAGC GACGCATGCG CGCTGGTATC GACGAGCTCG AGACCTACCA GACCTGGCTA CGTAGCCTCC CGGCCGACGA GTACGTCGCC CGCAGCCTGG AGCGCAAACA ACGTTTCCAG GCCCTCTCCC CCGCCGAACG AGGCGCCTCC GTCGCCGCCT GGGATCGCCA CCGGCTCCGC TACGGCCTCA CCGCCCAGCG CCTCGCCACA CCCGCACTCG ACGCGCGGAC AGCGACCCCT GACGTCGAGC AGGCAGCACT CCTGCCCGAC GGAGTCGCCG CACGCGATGC AACTTTCCTC GAGCGCCAGG GCAACCTTCT CGACGACGTC GAACGCCAGG CCGCGGGCTA G
|
Protein sequence | MLPVTVPASR GDAAHLDAGR RVSPALLALD EHLFNPDASQ WLVIDSGNTA VPRTVVPSLV DGLELIGKGR IAALVGHLRD GVGVVDVDVP GEFGDFLAAE VADWLSRRDC WVLERPSGGA KGRWHIFFAH SDFHYAPAAA RAGFAAAVSG FLTALAEDVK VPRGELDLRD AVRPLSSPHR FGAVTRPKGE LREALRGLKR VLPDPPSPSP LRPRAKVKPT NAGHKSTATG SGLVVPLALQ RWKRQLRTEW RNYLLTGEIP AGSWAAGATK TRAAVEVDRS LVEAACTREM VWAIGDPEMA WRIIRESHPT AMTKAKHQGY SWWLGYVWND LVRSASEFNT TSEKPRQVEA PPVEVVEAVA AARAELGRLM WSVPDRQRAA LLLVGHHLLD RVLRKRNLRV PCPERDLLLD TGLGDRKTVR AALARLNGRL GTLHTDCLSP VERDSTSYEF EINQAPEGEG RQIPPPGFDP PPAPRGLWAT LPRSSHSLWR TLLTCSTPLE LGDLVVKAGL VKAAGDEVSK SQRSTAKAAL VALSKAGMVR VDENGCWQAA TRPRSVQVEQ DAAAAYARQL ETIEAERAAY RAGTTSSWTA GRARAIKAQR AKEKAWWDNL SPAARAERAA AKRLEFDQMS ISQQAALKSR LAERRMRAGI DELETYQTWL RSLPADEYVA RSLERKQRFQ ALSPAERGAS VAAWDRHRLR YGLTAQRLAT PALDARTATP DVEQAALLPD GVAARDATFL ERQGNLLDDV ERQAAG
|
| |