Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0068 |
Symbol | |
ID | 4600119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 76344 |
End bp | 77522 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639774679 |
Product | inulin fructotransferase (DFA-I-forming) |
Protein accession | YP_921301 |
Protein GI | 119714336 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACAA CCGTTTACGA CGTCACCACC TGGACCGGCG CCACCGTCTC GCCTTACACC GATATCGGCC TGGTCATCAA CCAGATCATC GCTGACATCA AGAGCCAGCA GAACAGTCAG ACGACCCGCC CTGGCGCCGT CATCTACATT CCTCCCGGTC ACTACACCCT GCAGACGACA GCCAACATCG ACATTGGCTT CCTGACGATC AAGGGCTCCG GGCATGGCTT CATGTCCGAA GCAATCCGCG ACGACGTGAA CCACTCGGCC TGGGTCGAGA CTCTCCCGGG CTCAAGCCAT GTGCAGATCG CGAACAACAA CCAGGTCGGG TTCCTCGTCA ACCGGGCCAC CGACCCAGGG ACGAACGGCC GACTCAACTC GATCGTCTTT CAGGACTTCT GCATCGACGG CGTCAGCAGT ACCAAGCCCT ACATTCCTGG AAACGGAAAG ATCGGCATCA AGTCGCAGTA CGACACCGAC TCCCTTCGAA TCGAGGGGAT GGGCTTCGTC TACCTCAACA CAGCACTGAC CATCCGCAAT GCCGACGCCT TCAACATCAC CAACAACTTC ATCGCTGAGT GCGGCAGCTC CATCCAGCTG ACAGACAGCT CCATCGTCGG AAAGATCACC AACAACTACC TGATTAGCGC GTGGGCTGGG AACTCCATCT TCATCGAGAA CAACGAGGAC TGTGTCATCA GCGGGAACAG TCTCCTGTGG GGCGCTCGGA TCCAGATGAA GAATGTCCAT CGCGCAGTGA TTACTGGCAA CAAGTTCGTC AGCAACTTCT CCGGAATGAT CGTTCACGAA ACTCCGTGCC ACGAGCAGCT GATCTCGGGT AACCACTTCC GTCGCAAGTA CGGCGACGGT GGCCCTGCCC GCAATGACGA TCTCTTCGGC ATGGTCCATC TCAACGGCAA CGACAACTCC GTCACAGCCA ATCACTTCGC GTTCGAGGTG CCGGCCGCCA ACATCGTCCC CTCCGGAGCC ACCCCCACGG TCGTCCTAGT CAAAGGTGGG GCCCGCAACT TCCTCGCCAC AAACAAGTTG GCGTCAAACG TCGCGGTACG CCACGTGCTC GACGCGAGTT CCACGGCCAC GAAGGTTCTC TGGTCGGGCA CGGCGGCCCA ACTCCAAGAT CTGAGCGGGG GAAACATGTC CTTCGTGGCA ACGCCGTGA
|
Protein sequence | MATTVYDVTT WTGATVSPYT DIGLVINQII ADIKSQQNSQ TTRPGAVIYI PPGHYTLQTT ANIDIGFLTI KGSGHGFMSE AIRDDVNHSA WVETLPGSSH VQIANNNQVG FLVNRATDPG TNGRLNSIVF QDFCIDGVSS TKPYIPGNGK IGIKSQYDTD SLRIEGMGFV YLNTALTIRN ADAFNITNNF IAECGSSIQL TDSSIVGKIT NNYLISAWAG NSIFIENNED CVISGNSLLW GARIQMKNVH RAVITGNKFV SNFSGMIVHE TPCHEQLISG NHFRRKYGDG GPARNDDLFG MVHLNGNDNS VTANHFAFEV PAANIVPSGA TPTVVLVKGG ARNFLATNKL ASNVAVRHVL DASSTATKVL WSGTAAQLQD LSGGNMSFVA TP
|
| |