Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_12190 |
Symbol | fruB |
ID | 7760162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1173068 |
End bp | 1175941 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643804122 |
Product | fructose-specific multiphosphoryl transfer protein |
Protein accession | YP_002798421 |
Protein GI | 226943348 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGAGT TGAATGCCGC GCAGATCCGC ATGGGCCGCG CCGCGCCTGA CAAGCTGGCG GCGCTGGCGT TGCTGGCCGA GGGTCTGGTA GACGACGGGT TGGTCGAGCC CGGCTACCTC ACCGGCATGC AGGCCCGCGA AGCCCAGGGC TCGACCTACC TCGGCCAGGG CATCGCGATT CCCCATGGCA CGCCGGAAAC CCGCGACCAG GTGAAGCGTA CCGGCGTGCG TCTGATCCAG TTCCCCGACG GCGTGGATTG GGGCGATGGC CAGCAGGTCT ACCTGGCGAT CGCCGTCGCC GCCCGTTCCG ACGAGCACCT GCACCTGCTG CAACTGCTGA CCCGGGCGCT CGGCGAGGGC GATCTGAGCC AGGCGCTGCG CGAGGCCGGC GAGCCCGAAC AACTGGTCGC GCTGCTGCAG GGCGCGCCGC GCGAACTGGC CCTGGACAGT CAACTGATCG GCCTGGGCGT GGCCGCCGAG GACTTCGACG AACTGGTCTG GCATGGCGCG CGCCTTTTGA AGCGGGCCGG CTGCGTGGCT TCCGGCTTCG CCGGCAGCCT GCTGCAGAGT CAGCCGCTGC CTTTGGGCGA GGGACTCTGG TGGCTCAACA GCGAGCAGGC GGTGGAGCAA CCCGGGCTCG CCTTCGTCAC CCCGGCCCGG CCGCTCGTCC TGCACGGCCA GCCGCTGACC GGGCTGTTCT GCCTGGCCAG CCTGGGCGAA GCGCACCGCC AGTTGCTCGA GCGACTCTGC GACCTGCTGA TCGAGGGGCG CGGCGCCACC TTCAGCCAGG CCACCTGCAG CCGTGCCGTG CTGGAGGCCC TGGGCGCCGA ACTGCCGGAG GACTGGCCGA GCCTGCGCGT CACCCTGGCC AACGCTCACG GGCTGCACGC GCGGCCGGCC AAGGCGCTGG TCGAGGTCGC CCAGTCCTTC GACGGCGAGA TCCGCATCCG TCTGGCCGGC GAGAGCGGCC GCGGGGTGTC GGCCAAGAGC CTGAGCAAGT TGCTGGCGCT CGGCGCGCGG CGCGGCCAGG CACTGGAGTT CAGCGCCGAA CCGGCGATCG CCGCCGATGC CCTGCCGGCG ATCGAGGCGG CCGTGCTGGC CGGCCTCGGC GAGAGCATCG AGCCCCTGGC GTTGCCGGGC GAGACGCCGC CGTCCGAAGA GCCCGGCACC TCCACCTTCG TCCGGCCGCG AGCGCCAGCG GCCGGCACGC GTCTGCAGGC GGTCGCCGCC GCGCCGGGCA TCGCCATCGG TCCGGCGCTG GTGCGCACCC CGCTGGAACT CGACTACCCG CAGCGCGGCC AGGGCATGCT GGTCGAGCTG CAACGCTTGG ACAGCGCCCT GGCGCAGGTC ACCGCGGACA TCCAGCGGCT GATCGACGAC AGCGAGGAAG CCAACGTCCG CGAGATCTTC ATCACCCATC AGGCGATGCT GCGCGATCCG ACGCTGCGCG AGGACGTCAA CGCGCGTCTG GCCGACGGTT TCAGCGCCGA GGCCGCCTGG AGCGTGGAAA CCGAGGCGGT GGCGCAGCAG CAGGAGGCCC TGCACGATGC GCTGCTCGCC GAGCGCGCCG CCGACCTGCG CGACATCGGC CGGCGGGTGC TGGCGCACCT GTGCGGCGTC GAGGCGCCGC GCGAGCCGGA CGAGCCCTAC ATCCTGGTGA TGGACGAGGT GGCGCCCTCC GACGTGGCCA GCCTCAACCG CCTGCGCGTG GCCGGCATTC TCACCGCCCG CGGCGGCGCC ACCGCGCACA GCGCGATCAT CGCCCGCGCC CTGGGCATTC CGGCCATCGT CGGCGCCGGC GAGGCGGTGC TGGCGCTGGC CCAGGGCACC CCGTTGCTGC TCGACGGCGA CCACGGCGTG CTGCGCGTCG CGCCGGACGC GCAGACCCTC GAGCAGGCAC GCCGCGAGCG CGAGGCCAAC CGGCTGCGCC GCGAGCGCGC CCACGCCGAG CGCATGCTGC CGGCGGTGAC CCGCGACGGC CACGCGGTGG AGGTGGCGGC GAACATCGGC GCCAGCGGCG AGAGCGCCGA GGCGGTCGAG CTGGGCGCCG AGGGGGTCGG CCTGCTGCGC ACCGAACTGG TGTTCATGGA CCACGCCCAG GCGCCGGACC GGCACGCCCA GGAAGCCGAG TACCGCCGCG TGCTCGACGG CCTCGGCGGC CGGCCGCTGG TGGTGCGCAC CCTGGACGTC GGCGGCGACA AGCCCTTGCC CTACTGGCCG ATGCCGGCGG AAGAGAACCC CTTCCTCGGC GTGCGCGGCA TCCGCCTGAG CCTGCAGCGC CCGGACATCC TGGAGACCCA GTTGCGCGCG CTGCTGGCCT CGGCCGACGG CCGGCCGCTG CGGATCATGT TCCCGATGGT CGGCGGCGTC GAGGAATGGC GCATCGCCCG CGACCTGGCC CTGCGCCTGC GCGAGGAGAT CCCGGTCGAC GACCTGCAAC TCGGCATCAT GGTCGAGGTG CCCTCGGCGG CGCTGCTGGC CCCGGTGCTG GCCCGCGAGG TGGATTTCTT CAGCATCGGC ACCAACGACC TGACCCAGTA CGCCCTGGCC ATCGACCGTG GCCATCCGAC CCTGTCGGCC CAGGCCGACG GCCTGCATCC CGCCGTCCTG CGCTTGATCG GCATGACCGT CGAGGCGGCC CATGCCGAGG GCAAGTGGGT CGGCGTGTGC GGCGAACTGG CCGGCGACAT GCTCGCCGTG CCGCTGCTGG TCGGCCTCGG GGTGGACGAA CTGAGCGTCT CGGCGCGCTC CATCGCCCTG GTCAAGGCCA GGGTGCGCGA ACTCGACCTC GCCCATAGCC GGGCGCTGGC GCAGCGGGCC CTGGCGCTGG AGAGCGCCGG TGCGGTGCGC GACCTGGTCG GGGAGACGCA CTGA
|
Protein sequence | MLELNAAQIR MGRAAPDKLA ALALLAEGLV DDGLVEPGYL TGMQAREAQG STYLGQGIAI PHGTPETRDQ VKRTGVRLIQ FPDGVDWGDG QQVYLAIAVA ARSDEHLHLL QLLTRALGEG DLSQALREAG EPEQLVALLQ GAPRELALDS QLIGLGVAAE DFDELVWHGA RLLKRAGCVA SGFAGSLLQS QPLPLGEGLW WLNSEQAVEQ PGLAFVTPAR PLVLHGQPLT GLFCLASLGE AHRQLLERLC DLLIEGRGAT FSQATCSRAV LEALGAELPE DWPSLRVTLA NAHGLHARPA KALVEVAQSF DGEIRIRLAG ESGRGVSAKS LSKLLALGAR RGQALEFSAE PAIAADALPA IEAAVLAGLG ESIEPLALPG ETPPSEEPGT STFVRPRAPA AGTRLQAVAA APGIAIGPAL VRTPLELDYP QRGQGMLVEL QRLDSALAQV TADIQRLIDD SEEANVREIF ITHQAMLRDP TLREDVNARL ADGFSAEAAW SVETEAVAQQ QEALHDALLA ERAADLRDIG RRVLAHLCGV EAPREPDEPY ILVMDEVAPS DVASLNRLRV AGILTARGGA TAHSAIIARA LGIPAIVGAG EAVLALAQGT PLLLDGDHGV LRVAPDAQTL EQARREREAN RLRRERAHAE RMLPAVTRDG HAVEVAANIG ASGESAEAVE LGAEGVGLLR TELVFMDHAQ APDRHAQEAE YRRVLDGLGG RPLVVRTLDV GGDKPLPYWP MPAEENPFLG VRGIRLSLQR PDILETQLRA LLASADGRPL RIMFPMVGGV EEWRIARDLA LRLREEIPVD DLQLGIMVEV PSAALLAPVL AREVDFFSIG TNDLTQYALA IDRGHPTLSA QADGLHPAVL RLIGMTVEAA HAEGKWVGVC GELAGDMLAV PLLVGLGVDE LSVSARSIAL VKARVRELDL AHSRALAQRA LALESAGAVR DLVGETH
|
| |