Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_01050 |
Symbol | |
ID | 7759072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 106074 |
End bp | 107645 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643803031 |
Product | sulfate transporter |
Protein accession | YP_002797347 |
Protein GI | 226942274 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.176613 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTCGACT CGCACACGGA AGAAACCTCC ATGAACCTCG AAACCCTGAA ATCCACCCTG CCACGGGACG CCATGGCCTC CGTGGTGGTC TTTCTCGTCG CCCTGCCACT GTGCATGGGC GTCGCCATCG CCTCCGGCAT GCCGCCGGCC AAGGGTCTGA TCACCGGCAT CATCGGCGGG CTGGTGGTCG GCTGGCTGGC CGGCGCGCCG CTGCAGGTCA GCGGCCCGGC GGCGGGCCTC GCCGTGCTGG TCTTCGAACT GGTGCGCCAG CACGGCATCG CCATGCTCGG GCCGATCCTG CTGCTCGCCG GCCTGATCCA ACTGCTCGCC GGACGCCTGC GCCTGGGCTG CTGGTTCCGC GTCACCGCGC CGGCGGTGGT CTACGGCATG CTGGCGGGGA TCGGCATCCT GATCATCCTG TCGCAGTTAC ACGTGATGCT CGACGCTTCG CCGCAGGCTT CCGGCCTGAC CAACCTGCAG GCCTTCCCGG CGGCGCTGCT CGACGCCCTG CCGTTCGGCG AAGGCAGCGG CTGGAGCGCC GGGCTGCTCG GTCTCCTGAC CATCGGCACC ATGTGGCTGT GGGACAAGCG CAAGCCGGCC AGACTGCGTT TCCTGCCCGG CGCGCTGCTC GGCGTGAGCC TGGCGACCCT GGTCAGCCTG GTCTTCTCGA TGGACGTGCA CCGGGTCGAG GTGCCGGCCA ACCTGGGCGA TGCCATCGAC TGGGTACAGC CGGCCGACCT GTTGCGCCTG CTGACCGAGC CCAGCCTGCT GATCGCGGCG CTGACCCTGG CCTTCATCGC CAGCGCCGAG ACCCTGCTGT CGGCCGCCGC GGTGGACCGC ATGCACCAGG GCCCGCGCGC CGACTTCGAT CGCGAGCTGA CTTCCCAAGG CATCGGCAAC ATGCTCTGCG GCCTGCTCGG CGCCCTGCCG ATGACCGGGG TGATCGTGCG CAGCTCGGCC AACGTGCAGG CCGGCGCCGT CACCCGCCTG TCCGCCGTCC TCCACGGCCT CTGGTTGCTG GCTTTCGTCC TGCTGCTGAC CGCGGTGCTG CAGAGCATTC CGGTGGCCAG CCTGGCCGGG GTGCTGGTCT ACACCGGGGT CAAGCTGGTC GACCTCAAGG CCCTGCGCGG CCTGGGCCGC TACGGCCGCA TGCCGATGTT CGTCTACGCC GCCACGGCCC TGGCGATCGT CTGCACCGAC CTGCTGACCG GGGTGATGAT CGGCTTCGCC CTGACCCTGG CCAAGCTCGC CTGGCGCGCC TCGCGGCTGA AGATCAGCCT GGTCTACGAC GAGGACGGAC GCACCGCGGA GCTGCGCATG GTCGGCTCGG CGACCTTCCT CAAGGTGCCT GAACTGTCCC GCGTGCTGGC CACGGTGCGT CCCGGTACCC AGTTGCACGT ACCGCTGGAT CACCTGAACT ACATCGACCA CGCCTGCATG GAGCTGCTCG ACGAGTGGAG CGCCGCCAGC GCGGCCAACG GCTCCGAGCT GGTGCTCGAG CCCCGCGCGG TGAAACGCCG CCTGGAAGGC CGGCTGCGCA CCACGGTGGG CATCGGCGGC GCGACCGCCT GA
|
Protein sequence | MFDSHTEETS MNLETLKSTL PRDAMASVVV FLVALPLCMG VAIASGMPPA KGLITGIIGG LVVGWLAGAP LQVSGPAAGL AVLVFELVRQ HGIAMLGPIL LLAGLIQLLA GRLRLGCWFR VTAPAVVYGM LAGIGILIIL SQLHVMLDAS PQASGLTNLQ AFPAALLDAL PFGEGSGWSA GLLGLLTIGT MWLWDKRKPA RLRFLPGALL GVSLATLVSL VFSMDVHRVE VPANLGDAID WVQPADLLRL LTEPSLLIAA LTLAFIASAE TLLSAAAVDR MHQGPRADFD RELTSQGIGN MLCGLLGALP MTGVIVRSSA NVQAGAVTRL SAVLHGLWLL AFVLLLTAVL QSIPVASLAG VLVYTGVKLV DLKALRGLGR YGRMPMFVYA ATALAIVCTD LLTGVMIGFA LTLAKLAWRA SRLKISLVYD EDGRTAELRM VGSATFLKVP ELSRVLATVR PGTQLHVPLD HLNYIDHACM ELLDEWSAAS AANGSELVLE PRAVKRRLEG RLRTTVGIGG ATA
|
| |