Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1807 |
Symbol | |
ID | 4597644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 1920745 |
End bp | 1922211 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639776406 |
Product | hypothetical protein |
Protein accession | YP_923006 |
Protein GI | 119716041 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.932272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCCCGG ACGGACGAGT GACCCACATC GAGCCGGACG TGTTCGTGCG TTACCTCGAG CGCACCCGGT GGTTCGGCGG CAAGGGCCGC CCCTTCGAGG TCGCCTCCGT GCGCCGGATC GGCGAGGTGC CCCGCGACGA GGAGGACGGC GGCCCCCGTG TCGTCATCGA GCTGATCGAG GTCGCCTACA GCGACGGCCC GGGCGGCACC GAGGTCTACC AGGTCCCGAT GTCGTTCTAC ACCGAGCCGG AGAGCCGGCT CGACCACGCG TTCATCGGCT GGTGGGACGA GCCCGGCTAC GGCTGGGTGC ACGCGTACGA CGCCCTCCAC GACCGCGACG CGATGGACGG CTGGCTGCGT GCCTTCGACC GGGCGGCCCG CGAGCCGGGC GGCAACCTCA GCGACGACGA CAGCGGGCTG CGGTTCCACC GCCTGCCCGG GCACGACCTC GACCTCGACG CGCACTCGAC GCTGTTCTCC GGTGAGCAGT CGAACTCCTC GGTGGCGTTC GGCGAGGACT GCCTGATGAA GGTGTTCCGC AAGATCACCC CCGGGGTGAA CCCGGACATC AGCGTCCACG AGGTGCTCAC CAACGCCGGT TCCGACCACA TCGCGGCGCT GTACGGCTGG CTGGACTGGG TCGACTGGGA GGCCGACCCC GAGGCCGAGG AGCGCGCCGG CACGACGATG CAGCTGGCCA TGCTCCAGCA GTTCCTGCGG ACCGCGAGCG ACGGCTGGGA CCTGGCGCTG ACGAGCGCCC GCGACCTGTT CGCCGAGGCC GACCTGCATG CCCACGAGGC CGGCGGAGAC TTCGCCGCCG AGGCGGCCCG GCTGGGGACC GCGCTGCGCG AGGTCCACGA GGACCTCGCC GAGCACTTCC CGGTCGAGCG CCGCGGCCCC GAGGCGCTCA CCGAGCTGGC CGACGCGATG TCGGCGCGGC TCGACGCCGC GCTCGAGGTG GTGCCCGAGC TCGCCGCGCA CACCGAGACG CTACGGGCGA CGTACGACCG CGTCCGTGGA CTCGGCGGAC TGGAGGTCCA GCAGATCCAC GGGGACCTGC ACCTGGGCCA GACGCTGCGC ACCAGCCTGG GCTGGAAGAT CGTCGACTTC GAGGGCGAGC CGGCCAAGCC GCTCGCCGAG CGGCTGCGAC CGGACTCGGT GTGGCGCGAC GTCGCGGGCA TGCTGCGCTC CTTCGACTAT GTGCCCCGGG TGGTCGAGCG CCAGTTCGCC GAGGACCAGC CCGAGGGCGC CAGCCAGCGC GCCTACCGCG CGGAGGAGTG GGCGCACCGC AACCGCAACC ACTTCCTGAC CGCGTACGCC GGCGGCGAGC TCACCGAGGA GCAGCAGGCG CTGCTCGACG CCTATGTCGT GGACAAGGCG GTGTACGAGA CCGTGTACGA GACACGAAAC CGTCCGACCT GGGTGGCCAT CCCGCTCGAG GCCGTGGCGA GGATCGGAGC GGCATGA
|
Protein sequence | MTPDGRVTHI EPDVFVRYLE RTRWFGGKGR PFEVASVRRI GEVPRDEEDG GPRVVIELIE VAYSDGPGGT EVYQVPMSFY TEPESRLDHA FIGWWDEPGY GWVHAYDALH DRDAMDGWLR AFDRAAREPG GNLSDDDSGL RFHRLPGHDL DLDAHSTLFS GEQSNSSVAF GEDCLMKVFR KITPGVNPDI SVHEVLTNAG SDHIAALYGW LDWVDWEADP EAEERAGTTM QLAMLQQFLR TASDGWDLAL TSARDLFAEA DLHAHEAGGD FAAEAARLGT ALREVHEDLA EHFPVERRGP EALTELADAM SARLDAALEV VPELAAHTET LRATYDRVRG LGGLEVQQIH GDLHLGQTLR TSLGWKIVDF EGEPAKPLAE RLRPDSVWRD VAGMLRSFDY VPRVVERQFA EDQPEGASQR AYRAEEWAHR NRNHFLTAYA GGELTEEQQA LLDAYVVDKA VYETVYETRN RPTWVAIPLE AVARIGAA
|
| |