Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0806 |
Symbol | |
ID | 4598888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 849369 |
End bp | 850565 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639775409 |
Product | ABC-type sugar transport system periplasmic component-like |
Protein accession | YP_922018 |
Protein GI | 119715053 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGCA AGAAGTGGCT CGCGTCGACA GCCGTCACCC TCGTCGTCTC CCTCGCGCTC GTGGCGTGCG GCTCGGGCGA TTCGGAGGAC TCGAGCACCG GCTCGGGGAG CAACGGTGAC TCGGCGGCGC TCGTCGCCTC GGCTGAGCAG TTGGTCGAAC GCGCCCAGCA GGGCCTGACC TACTCGTCGA CAGACACCCC CGACACTCCT GACCAGATCG AGACCTACGG CGACTGGCGC GGCCCGACGG ACGCACCCGA GATCAAGAAG GGTGCCAACG TCCAGGTGAT CGTGTGCACC AAGCAGGCCG TCGCCTGCAT GGAGGGCGCT CAGGGCGTTG TCGACGCGGG GAAGGCCCTG GGCTGGAAGG TCGAGGTCAT CGACGGCGGC GGAACTCCCG AAGGCAATGC CAAGGCCTTC GACACCGCCT TCAGTCGCAA CCCCGACGCC ATCGTGGGTA TCGCCGTTCC TGCGCTGGCC GTTGGAGACA AGCTCGAGGA GGCCAGGGCG CGCGGGATCA TCACCTCGGT GACCGGGGAC ATCCCGCCGA CCGGGGGGGA GACCGCCTAC GACTCGATCG TGTCCTTCCG GATGACGCTG ATGATGTCGA CGATCGGCTT CGCCGAGATC GCTCGCACCG ACGGCAAGGC CAACTCGATC GTCGTGACGG ACAGCGGCAC GCCGTCTCTT GTCGAGGCAA TGGGCCAGTA CAAGAAGGTT CTCGACACCT GCAGCGGCTG CGACTACACC GACGTCTCGT GGACCATCTC GGACGCGGTC GACCCGACCA AGGTGACCTC GATCGTCACG GGTGCGATCT CGCAGAACCC CAACGCGACC TCGATCGTCG TTCCGTACGC GATCGGCCTG CCGGCGGTCA TCCAGGCGGT CGCCTCGGCC GGGAAGGCCG ACCAGATCAA GGTGATCACC AAGGACGCCG ACGCCGTGGG CCTGCAAGCC GTCGCCGAGG ACCAGGTCAT CTATAACAGC GGGGCTTCGG CGACGTGGGC CGGCTGGGCC GTGGCAGACC AGCTGGTCAG GGGGCTGGCC GGCGCGCCCT ACCTCGATGT CGACGAGATC GGACTCGGGC TGATGTTCTT CATCAAGGAC AACGTGCCGA GCGACGGCAA GATCGACAGC GCCGAGGGAA TGATCGACTA CGCCTCCTTC TACAAGAAGG CGTGGGGTCT GAGCTGA
|
Protein sequence | MRSKKWLAST AVTLVVSLAL VACGSGDSED SSTGSGSNGD SAALVASAEQ LVERAQQGLT YSSTDTPDTP DQIETYGDWR GPTDAPEIKK GANVQVIVCT KQAVACMEGA QGVVDAGKAL GWKVEVIDGG GTPEGNAKAF DTAFSRNPDA IVGIAVPALA VGDKLEEARA RGIITSVTGD IPPTGGETAY DSIVSFRMTL MMSTIGFAEI ARTDGKANSI VVTDSGTPSL VEAMGQYKKV LDTCSGCDYT DVSWTISDAV DPTKVTSIVT GAISQNPNAT SIVVPYAIGL PAVIQAVASA GKADQIKVIT KDADAVGLQA VAEDQVIYNS GASATWAGWA VADQLVRGLA GAPYLDVDEI GLGLMFFIKD NVPSDGKIDS AEGMIDYASF YKKAWGLS
|
| |