Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4082 |
Symbol | |
ID | 4596596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 4310245 |
End bp | 4311825 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639778688 |
Product | extracellular solute-binding protein |
Protein accession | YP_925266 |
Protein GI | 119718301 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.484283 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAGCA CAGCACGTAG TCGCTCGGCA CGAGCCCGGG CCGGGCTCGT AGCGGTCGCC GCAGCGGCCG CCCTCCTCCT CGCCGCGTGC GGGGGAGGCT CGGGTGGCAA CCAGAGCGAC GGGCCGGAGA CCACGAAGAT CACCGACCTG GTGGTCGACA GCGGTCAGGA GCCCGACTCC CTGGACCCCT TCTACCGCAA CACCGCGGAG GCGCAACGCT TCTACCGCCT GGCCTACAGC AGCATCCTGA AGTGGAACGA GGACGGTTCG CTCGCCCCTG ACCTGGCGGC GGAGCTGCCC GAGGTGACCG ACGGCGGCAA GACGTGGACG ATCACGCTCC GCGACGGAGT CACCTTCCAT GACGGCACAC CGCTGACGGC CGACGACGTG GTCTTCACGT TCGAGGCCGC GGCGAACCCC GAGAACGGTG CCGTCTGGCT CTCCTCGCTG AGCTACATGG AATCCGTCAA GGCCGTCGAC GACACCACGG TCGAGCTGAA GCTGACCGAG CCGTACGCCT ACATGGGTAG CCGGCTCGCG ATGATACCGA TCCTGTCGGA CGAGACGCCG TACAAGACCA ACGACACCTA TGCCACAACC GAGAACGGCA GTGGTCCGTA CGTGCTGGAG AAGCTCAACC GCGGCGACTC CATCGAGATG GCGCGGTTCG GAGACTACTT CGGCGACCAG CCGCCCTTCG AGACCATCAC CTTCAAGGTC GTTCCGGAGG ACGCCTCCCG GATCGCTCGC CTGCTCAACG GTGAGTCCCA CATCCTGCCG AACGTACCCA CCGACCAGGT CGAGCTGATC AAGGACCGCG GCGCGAACGC CGCGATCGTC GAGAAGAACG TCGTCCGCCT GTTCCTCTAC CCGTCGATGA ACCCCGACCG GCCGACCTCG AACGTCGACT TCCGGCTTGC GATCGCCTAT GCGGCCGACC GGCAGCGGAT CGTGGACCAG GTGTACGGCG GCGCGGGCCG TCCGAACAGC ACCTACCTGA CCTACGGATC GCTCTACCAC GACGAGGAGG TCGGGATGAC CTTCGGTTCG ACGCCCGACA TCGAGGCCGC GAAGGAGCAC CTCGAGGCGT CGGGCTACGA CACGAGCCGC ACCCTGAAGA TCATCGCGGT GAACAAGCCG AGCGTGGTGA GAGCGATGAC GATCCTGCAG GCCAACCTCA AGGCGATCGG CGTGACCGCC ACCGTCGAGT CGCAGGAGGT GGCCGGCTTC TACTCGGCGC TCATCTCCGG GGAGTACGAC CTGATCGCCT TCGACAGCCC GGCGTCGACG TCGGCGGGCT TCGCTCCCGA CTACGTCAAC GGTGGCCTGA ACAGCAAGGC GGCGAACAAC TTCGCGAAGT TCAACGACCC GGAGATGGAT CGGCTCCTCG ACACGGCCAT GACGGCCCAG ACCGAGGAGG AGCAGGCCGC GGCCTGGAAG GCGGTCCAGG AGCGTGACGT CGCGACGCAG GGCAACATCC AGCTCGTCGC GGCTCAGGTC AGCGAGGCAT GGTCCAAGGA CCTGGTGGGC TACGAGCCCT CCGGGCTCCT GTGGCTGAAC ACCGTGCTCG ACGTCAAGTA G
|
Protein sequence | MTSTARSRSA RARAGLVAVA AAAALLLAAC GGGSGGNQSD GPETTKITDL VVDSGQEPDS LDPFYRNTAE AQRFYRLAYS SILKWNEDGS LAPDLAAELP EVTDGGKTWT ITLRDGVTFH DGTPLTADDV VFTFEAAANP ENGAVWLSSL SYMESVKAVD DTTVELKLTE PYAYMGSRLA MIPILSDETP YKTNDTYATT ENGSGPYVLE KLNRGDSIEM ARFGDYFGDQ PPFETITFKV VPEDASRIAR LLNGESHILP NVPTDQVELI KDRGANAAIV EKNVVRLFLY PSMNPDRPTS NVDFRLAIAY AADRQRIVDQ VYGGAGRPNS TYLTYGSLYH DEEVGMTFGS TPDIEAAKEH LEASGYDTSR TLKIIAVNKP SVVRAMTILQ ANLKAIGVTA TVESQEVAGF YSALISGEYD LIAFDSPAST SAGFAPDYVN GGLNSKAANN FAKFNDPEMD RLLDTAMTAQ TEEEQAAAWK AVQERDVATQ GNIQLVAAQV SEAWSKDLVG YEPSGLLWLN TVLDVK
|
| |