Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0404 |
Symbol | |
ID | 4597716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 434682 |
End bp | 435788 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639775019 |
Product | putative sugar ABC transporter, substrate-binding protein |
Protein accession | YP_921634 |
Protein GI | 119714669 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGATGA ACAAACTCAT CGTGGCCGGT GTCGCCGCGG TGTCTGCCTT GAGCCTCGCT GCCTGTAGCC AGGGAAGCGG CGCCACCACT GCGGAGCCGA AGGCCCAGGA CTCCGCCTCC GTGCAGGCGG CGAGCGGGGA CGCACCTGCG CCGTTCGACC AGGCCGGGGT CCGCGTGGCC ATCGTGCAGA ACAGCGGCCA GGGTGACTAC TTCCAGCAGT ACCTGAACGG CACGAAGCAG CAGGCGGCAG CGCTCGGGAT CGACCTGAGC GTGTACGACG CCCAGGGCGA CAACGCCACG CAGGCGACCC AGCTCGATCA GGCCATCTCA TCCGGCGTGC AGGGCATCAT CGTGCGCCAC GGCTTCCCCG ACACGCTCTG CCCGGGCGTC AACAAGGCGA TCGACCAGGG CATCAAGGTC GTCATCTACG ACGTCGAGAT CCAGAAGTGC GCGCCGCAGG CCGTGCAGAC CCAGCAGTCC GACAACAAGA TGGCGAGCCT CGTGCTGGAC AAGATGGCCG AGGACATCGG CACCGGCAAG CCGGTCGGCT ACGTCAACGT CGCCGGTATC GCGCCGCTGG ACCGCCGGGA CCTCGTGTGG CAGGACTACC TGGCCACGAA CGACTGGACC CAGAAGTTCA AGACCGGCAA GTTCACGAAC TCCACGGCCA CGGACACCGC GCCGATGGTG GACAGCGTCT TGAAGTCCAA CCCGGACGTG GTCGCCGTCT ACGCGCCGTA CGACGAGCTC ACCAAGGCGA CCCTCTCCGC GCTGAAGCAG AATCCGAGCC TGCAGGGCAA GGTCAAGGTC TACGGCGCCG ACATCTCCAC GGCCGACATC GAGCTCATGA CCAACCCGGA CAGCCCGTGG GTGGCGACCG GCGCGACGGA CCCCAACGCC ATCGGTGCTG CGGTGGTCCG CACGCTCGCA CTCCACATGG CCGGCGAGCT GGACGGGCTC AACGTCGAGT TCCCGCCGAT CCTGGTCACG CAGGACTTCC TGCGGTCGGA GGGGATCAAG AACATGGATG ATCTGCGTGC AGCGGAGCCG GCGCTGAACA TCGCCGACGT GTCGTCCGCC GACTGGATCC CGGCCGTCAC GTTCTGA
|
Protein sequence | MRMNKLIVAG VAAVSALSLA ACSQGSGATT AEPKAQDSAS VQAASGDAPA PFDQAGVRVA IVQNSGQGDY FQQYLNGTKQ QAAALGIDLS VYDAQGDNAT QATQLDQAIS SGVQGIIVRH GFPDTLCPGV NKAIDQGIKV VIYDVEIQKC APQAVQTQQS DNKMASLVLD KMAEDIGTGK PVGYVNVAGI APLDRRDLVW QDYLATNDWT QKFKTGKFTN STATDTAPMV DSVLKSNPDV VAVYAPYDEL TKATLSALKQ NPSLQGKVKV YGADISTADI ELMTNPDSPW VATGATDPNA IGAAVVRTLA LHMAGELDGL NVEFPPILVT QDFLRSEGIK NMDDLRAAEP ALNIADVSSA DWIPAVTF
|
| |