Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_31720 |
Symbol | |
ID | 7762072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3280793 |
End bp | 3281791 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643806046 |
Product | Sulfate-binding precursor protein |
Protein accession | YP_002800310 |
Protein GI | 226945237 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCATCC GTCGTTTCGC CCTCGCCGCC CTGGCCGGCC TGAGCCTGAG CGCTGCCGCC CAGGCGCAGA CCCTGCTGCT CAACGTGTCC TACGACCCGA CCCGCGAGTT GTACCGGGAA TACAACGCCG CCTTCAACAA GCACTGGCAG GCCGAGGGCC ATGAGCCCGT GACCATCCAG CAGTCCCATG GCGGCTCGGG CAAGCAGGCG CGCGCGGTGA TCGACGGACT CAAGGCCGAC GTGGTGACCC TGGCCCTGGC CGGCGATATA GATGAATTGC ACAAGCTCGG CAAGCTGATT CCCGAGGACT GGCAGAGCCG CCTGCCGCAG GCCAGCACAC CCTACACCTC GACCATCGTA TTCCTGGTGC GCAAGGGCAA TCCGAAAGGC ATCAAGGACT GGGGCGACCT GGTCAAGCCG GGCGTGGAAG TGATCACCCC GAATCCGAAG ACCTCCGGCG GCGCACGCTG GAACTTCCTC GCCGCCTGGG CCTGGGCACA GCAGCAGTAC GGTAGCGAGG ACAAGGCCCG CGCCTACGTC GAACAGCTCT TCAAGCAGGT TCCGGTGCTG GATACCGGAG CGCGCGGCTC GACCATCACC TTCGTCAACA ATAAAATCGG CGACGTCCTG CTGGCCTGGG AAAACGAGGC CTTCCTGGCC CTGAAGGAAC AGGGTGGGGA AAACCTCGAG ATCGTCGTGC CTTCGCTGTC GATCCTCGCC GAACCGCCGG TGGCGGTGGT GGACAAGAAC GTCGACCGCA AGGGTACCCG CGAACTGGCC ACCGCCTACC TGAACTATCT GTACAGCGAG GAAGGCCAGC GCATCGCCGC GAAGAATTTC TACCGTCCGC GCAACGAGAA GGTCGCCACC GAATTCGCCA AGCAGTTCCC CAACCTCAAG CTGGTGACCA TCGACAAGGA TTTCGGTGGC TGGAAAACCG CCCAGCCGAA GTTCTTCAAC GATGGCGGGG TGTTCGATCA GATCTACAAG GCGCACTGA
|
Protein sequence | MSIRRFALAA LAGLSLSAAA QAQTLLLNVS YDPTRELYRE YNAAFNKHWQ AEGHEPVTIQ QSHGGSGKQA RAVIDGLKAD VVTLALAGDI DELHKLGKLI PEDWQSRLPQ ASTPYTSTIV FLVRKGNPKG IKDWGDLVKP GVEVITPNPK TSGGARWNFL AAWAWAQQQY GSEDKARAYV EQLFKQVPVL DTGARGSTIT FVNNKIGDVL LAWENEAFLA LKEQGGENLE IVVPSLSILA EPPVAVVDKN VDRKGTRELA TAYLNYLYSE EGQRIAAKNF YRPRNEKVAT EFAKQFPNLK LVTIDKDFGG WKTAQPKFFN DGGVFDQIYK AH
|
| |