Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50220 |
Symbol | |
ID | 7763873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5090803 |
End bp | 5091942 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643807853 |
Product | LacI regulatory protein |
Protein accession | YP_002802087 |
Protein GI | 226947014 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTATTGG GGAGGGAATT GGGCTGCTAT AGTCCGGCCG CGTTATATGT CAGAACATCT CAAGGACAGC AGAAGAGCTT TCATTTCATG CCTGAACCCG TCAGCCCTCC GCCCCGGCGT CCTACCCTGC AGGATGTCGC CCGCGCCGCC GGCGTGAGCC TGTCGACCGT CGACCGGGTG ATCAACCTGC GCGCCAACGT GCGCGCCGAC ACCGCGCGGC GCATCGCCGA GGCGGCGGCG CGGCTGGGCT TCCATGGCCG CGGCGTCATC GAGCAGCGCC TGCTGGAACA GCGCCGGACC GTGCGCCTCG GCTTCCTGCT GTGGAAACGC AACACGGCCT TCTACCGCGG GCTGGCCGAC GGACTGGCCG AAGCCGCCGC GGCCTGCGTC CGGGCCCAGG TGCGGGTGAC GATCGGCTAC CTGGACAACC TGGACCCGGA GGCGACCGCG GCGCGCATGC TGGCCCTGCG CGGCGAGGTG GACGCGCTGG GCATAGTCGC CGCCGACCAC CCGGCCGTGC GCGAGGCGGC GACCGCCCTG CGCGCGGCGA GCCTGCCGGT GGCGGCGCTG GTTTCCGAGC TGGCCGCCTC GACCGGGGCG GGCTACGTCG GCCTGGACAA CCGGCGCGTG GGCCGCACGG CCGCCTGGTT CGTCGCGCAG CTCGCCCGCC AGCCGGGGGC CGTGGCGCTC ATGGTCGGCA GCCAGCGCTT CCAGTGCCAG GAACTCTGCG AACTGAGCTT CCGCAGTTTC CTGCGCGAGC ACTCCGAGGA CTGGGAACTG CCGGCGCCGC GCCTGACCCT GGAGGACGAA ACCTTCGCCT ACGAGAACAC CCTGGATCTG CTCAGCGGCC AGCCCGACCT GGCCGGGCTC TACGTCGGCG GCGGCGGGAT CGAGGGCGTC CTGCGCGCCC TGCGCGAGCA GCGCGGCGCC CGGCCGGTGG TGGTCTGCCA CGACCTGACG CCGGTGACGC GGGCCGCCCT GAACGATGGG ATCGTGCAGG TCGTGCTCTC GCATCCGCTG GCCGAGCTGG CGCGCTCGGC GGTGGACCTG CTGGTCGAGG CCGCGACAGG CCGGGAAGCG ACGTCCGCGC TGCCCAAGCG CATCCTGCCC ATCCAGATCG ATATTCCCGA AAGCGTCTGA
|
Protein sequence | MLLGRELGCY SPAALYVRTS QGQQKSFHFM PEPVSPPPRR PTLQDVARAA GVSLSTVDRV INLRANVRAD TARRIAEAAA RLGFHGRGVI EQRLLEQRRT VRLGFLLWKR NTAFYRGLAD GLAEAAAACV RAQVRVTIGY LDNLDPEATA ARMLALRGEV DALGIVAADH PAVREAATAL RAASLPVAAL VSELAASTGA GYVGLDNRRV GRTAAWFVAQ LARQPGAVAL MVGSQRFQCQ ELCELSFRSF LREHSEDWEL PAPRLTLEDE TFAYENTLDL LSGQPDLAGL YVGGGGIEGV LRALREQRGA RPVVVCHDLT PVTRAALNDG IVQVVLSHPL AELARSAVDL LVEAATGREA TSALPKRILP IQIDIPESV
|
| |