Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_51300 |
Symbol | |
ID | 7763970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5212337 |
End bp | 5213659 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643807950 |
Product | major facilitator superfamily (MFS) permease |
Protein accession | YP_002802184 |
Protein GI | 226947111 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCACG CGACACCGGA CGACAGTCTC CATGCCGGTA CCATGGCCCG GCTCAACCTG AAGCTCATTC CCTTCCTGAT GCTGCTGTAT CTGATCGCCT ATATCGACAA GGCGAACATC TCGGTGGCGG CCCTGCAGAT GAATGCCGAT CTCGGACTGA CCGCCCGGAT GTACGGCATA GGGGTCGGGC TGTTCTTCGT GACTTATATC CTGCTGGAGG TGCCCAGCAA CCTGATCCTC TCCCGCGTCG GTGCGCGCCG CTGGATCGCC CGCATCATGA TCACCTGGGG CTTGGTGGCC GCCGGCATGA GCCTGGTACG GAGCGCCGAC CAACTGTACG TCATGCGCCT GCTGCTGGGC GCGGCGGAGG CCGGCTTCAC GCCAGGCATC ATCTATTACC TGGCGCAGTG GTATCCGCGC AGCGACCGGG CCCGCGCGAT GTCGTTCTTC TATATCGGCG CGGCGCTGGC CTCGGTGATC GGCCTGCCGC TGTCGGGAGC CTTCCTGCAT CTGGACGGGC TGCTGGGCAT CGCCGGCTGG CGCTGGCTGT TCCTGCTCGA AGGTCTGCCG GCCGCCGTGC TCGGCGTGGT GGTATTGCGC TATCTGCCGG AGTCGCCCGA GCGCACCCAC TGGCTCGACG CTGCCCAGAA GCGCTGGCTG ACCCGCACCC TGGCGGCCGA GCGGGAGACC ACGGCGATTT CCCATCACGA CGCCTGGCAT GTGGCTTTCC GCAGCCGGCA GGTCTGGCTG CTCAGCCTGT TCTGGCTGCT GCAGGCCTTC GGCACCATCG GCATCACCCT GTTCCTGCCG CTGATCGTGC AGTCGGTATC GGGGCAGGGC GCCTTCACGG TCAGTCTGCT GTCGGCCTTG CCCTTCCTCT TCGCCTGCCT GTTCATGTAC TTGAACGGTC GTCGTTCCGA TCTTTCCGGC GAGCGTGGCC TGCACCTGGG CGGCCCGCTG CTGGGCGCCG GTATCCTGCT CGGGACCGCG GTGATTACCG ACAGCCAGCC GCTGGCCTAC GCGATGCTGG TGTTCACCGT CGCCCTGAAC TGGGCGGCGA CCCCGGTGTT CTGGGCGACC ACCACCGAGT ACCTGTCCGG GCCGGCGGCG GCGGTGTCGA TCGCGCTCAT CAACGCCATC GCCAATATCG CCGGCCTGGG GCTGCCGCCG GTCATGGGCT GGATCAAGGA CACCACCCAC AGCTACGACT ACGCCCTGCT GCTGGTGGCC TGCGCCCTGC TGGCCGGCGG TCTGCTGGGA CTCCACCTGG GCGGCGGGCA GGCCCGTCGC CCGGTCCGTG ATCATGTTTC GAGGAGCCAT TGA
|
Protein sequence | MTHATPDDSL HAGTMARLNL KLIPFLMLLY LIAYIDKANI SVAALQMNAD LGLTARMYGI GVGLFFVTYI LLEVPSNLIL SRVGARRWIA RIMITWGLVA AGMSLVRSAD QLYVMRLLLG AAEAGFTPGI IYYLAQWYPR SDRARAMSFF YIGAALASVI GLPLSGAFLH LDGLLGIAGW RWLFLLEGLP AAVLGVVVLR YLPESPERTH WLDAAQKRWL TRTLAAERET TAISHHDAWH VAFRSRQVWL LSLFWLLQAF GTIGITLFLP LIVQSVSGQG AFTVSLLSAL PFLFACLFMY LNGRRSDLSG ERGLHLGGPL LGAGILLGTA VITDSQPLAY AMLVFTVALN WAATPVFWAT TTEYLSGPAA AVSIALINAI ANIAGLGLPP VMGWIKDTTH SYDYALLLVA CALLAGGLLG LHLGGGQARR PVRDHVSRSH
|
| |