Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_2607 |
Symbol | |
ID | 4597155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 2773337 |
End bp | 2774536 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 639777213 |
Product | galactokinase |
Protein accession | YP_923798 |
Protein GI | 119716833 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | [TIGR00131] galactokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00377614 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTTCG TCGAACCAGG TGACCCCCCC GAGCTCGCGG ACCGGGTCCG CGCGGCGTTC GCCGAGGCGT ACGGCGGCCC GGCGGTCGCG GTCGGGCGTG CCCCCGGGCG GGTGAACCTG ATCGGCGAGC ACACCGACTA CAACCACGGC CTGGTGCTGC CGGTCGCGCT CCCCCACGCC ACCTACGCGG CGATGGCGCC GCGCACCGAC GGCCGGATCC GGATCGGCAG CCTGGAGGAG GCCACTCCGT GGAGCGGGTC GCTCGAGGAC GTGGGTCCGG GCTCGGTCGA CGGCTGGGCG GCGTACGCCG CGGGCGTGCT GTGGGCGATG CGCGAGGACG GGTACGGCGT CCCCGGGGTG GACCTGCTGG TCCACGGCAC GGTGCCGCTC GGCGCCGGCC TGTCCAGCTC GGCAGCCCTG GAGTGCTCGG TCGCGCTCGC GGCGTGCGGG CTGCTCGGGG TCGAGCCGGA CGCCGGCGTA CGGCAGCGCC TCGTCGCCGC CTGCATGCGC GCCGAGACCG AGGTGGCCGG GGCCCCGACG GGCGGGATGG ACCAGACGGT GTCCCTCCTG GCGTCGGCGG GATCGGCGCT GCTCATCGAC TTCGACGTCG CGGCGGAGAC GGAGGGCGCG ACCCAGGACG TCGCGCTCGG CCTCGACGCC GCGGGGCTCG CGCTGCTGGT GACCGACACC CGGGTCTCGC ACGCGCTCGT GGACGGCGGG TACGCCGCAC GCCGGGCCGA CTGCGAGGCG GCCGCCGAGG CGCTCGGGGT GCCGTCGCTG CGCCGGGCCT CGCTCACCGA GGTCGAGGGG CTCGACGACG AGCGGGTGCG CCGGCGCGCC CGGCACATCG TCACCGAGAT CGACCGGGTC CGGGCCACGG TCGCCGCCCT CGGCGCCGGC GACTGGCAGG GGGTCGGCCG CGCCTTCCGT GACTCGCACG TGTCGATGCG CGATGACTTC GAGATCTCCT GCCCGGAGCT GGACGTGGCG GTCACCACCG CCGTCGAGGC CGGGGCGATC GGGGCCCGGA TGACCGGCGG CGGCTTCGGC GGCTCCTCCA TCGCGCTCGT CCCGGTCGAG CGCGTCGACG CCGCTGTCCG GGCGATCGAC GCCGCATTCG TCGCCGCCGG CTTCGGGCCG CCGCAGCACC TGCGCGCCGT CCCCTCGAGC GCCGCCGACC TGGTGGACGC ACCCGCCTGA
|
Protein sequence | MAFVEPGDPP ELADRVRAAF AEAYGGPAVA VGRAPGRVNL IGEHTDYNHG LVLPVALPHA TYAAMAPRTD GRIRIGSLEE ATPWSGSLED VGPGSVDGWA AYAAGVLWAM REDGYGVPGV DLLVHGTVPL GAGLSSSAAL ECSVALAACG LLGVEPDAGV RQRLVAACMR AETEVAGAPT GGMDQTVSLL ASAGSALLID FDVAAETEGA TQDVALGLDA AGLALLVTDT RVSHALVDGG YAARRADCEA AAEALGVPSL RRASLTEVEG LDDERVRRRA RHIVTEIDRV RATVAALGAG DWQGVGRAFR DSHVSMRDDF EISCPELDVA VTTAVEAGAI GARMTGGGFG GSSIALVPVE RVDAAVRAID AAFVAAGFGP PQHLRAVPSS AADLVDAPA
|
| |