Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_46920 |
Symbol | |
ID | 7763555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4763657 |
End bp | 4765474 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643807536 |
Product | sulphate transporter |
Protein accession | YP_002801772 |
Protein GI | 226946699 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | [TIGR00815] high affinity sulphate transporter 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.326682 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATACCT CGCTGCCACG GCGGCTCGCC CGCCACCTGC CGTGCCTGGA ATGGGCACGC CGGTACGACC ACGAAACCGC GGGCAAGGAC GGCCTGGCCG CGCTGATCGT CACCCTGATG CTGATCCCGC AGAGCCTGGC CTACGCCATG CTCGCCGGCC TGCCGCCGGT CACCGGACTG TACGCCAGCA TCCTGCCGCT GGTGGCCTAT ACCCTGTTCG GCACCAGCCG CACCCTGGCG GTCGGTCCGG CGGCCGTGCT CTCGCTGGTC ACCGCCAGCG TGCTCGCTCC GCTGTTCGCC GCCGGCAGCG CCGAGTACAA CGCCGCCGCC CTGCTGCTGG CGCTGCTCTC CGGAATCGTC CTGCTGGCCA TGGCGGCGCT GCGCCTGGGG TTTCTCGCCA ACTTCCTCAG CCATCCGGTG ATTTCCGGTT TCATGAGCGC TTCGGGCATC CTCATCACCC TGGGGCAGCT CAAGCACATC CTCGGCATCG AAGCCGACGG CGAGAACGCC ATCGAGCTGC TCGGGGCCCT GGTCCGGAGC CTGCCACAAA CCAACCTGCC GACCCTGGCC ATCGGCATCG GCAGCCTGTT TTTCCTCCAT CTGGCCCGCT CGCGACTGCA TGGCTGGTTA CTCGCCCGCG GTTTCGGCGC GAAGATTGCC GGCACCCTGG TCCGTACCGG TCCGGTGGTC GCGCTGCTGG CGTCGGTGCT ACTGGTCTGG CTGTTCGGCC TGGATGCCGC CGGGGTACGC GTGGTCGGCC AGACGCCCCA GGGCCTGCCA TCGTTCGCGC TGCCGCCGCT GGATGCGGCG CTGGCCGGCG AACTGCTGCC GGCGGCGCTC CTGATCAGCC TGATCGGCTT CGTCGAATCG GTGTCGGTGG CGCAGACCCT GGCCGCCAGG CGTCGCCAGC GCATCGAGCC GAACCAGGAA CTGGTCGGTC TCGGCGCCGC CAATCTGGCT GCGGCGCTGA GCGGCGGCTT CCCGGTCACC GGCGGTCTGT CGCGCTCGGT GGTGAACTTC GATGCCGGCG CGCAGACGCC CATGGCCGGC GCCCTGAGCG CCGTGGGCAT CACCGTCACC GTGCTGTTCT TCACTCCGCT GTTCCACAAC CTGCCGCATG CCGTGCTGGC GGCCACCATC ATCGTCGCGA TACTGACCCT GGTGGATCTC GGCGCGCTCG GGCGCACCTG GCGCTATTCG CGCCAGGATG CCGCGGCCAT GGCGGCGACC ATGCTCGGCG TGCTGCTGAT CGACGTCGAG GCCGGCATCC TCATCGGGGT CGGCCTGTCC CTGCTGCTGT TCCTCTGGCG TACCAGCCAG CCGCACATCG CCGTGGTCGG CCAGTTGCCG GGCAGCGAAC ACTTTCGCAA CGTCAAACGC TTCGCCGTGG TGGAGAGCCC GAAGGTACTG TCGATCCGCG TCGACGAGAG CCTGTATTTC CCCAACGCCC GCTATCTGGA AGACCGCGTC GCCGAACTGG TCAGCCAGCA TCCCCGGGCC GAACACCTGG TGCTGATGTG CCCGGGGGTC AACCTGATCG ACGCCAGCGC CCTGGAAAGC CTGGAGGAGA TCGGTGCACA CCTGCACGCC GCCGGCATCC AGTTGCATCT CTCCGAGGTC AAGGGGCCGG TGATGGACCG GCTCAGGCAC TCGGACTTTC TCGAACACTT CGGCGGCCGG GTCTTCATCA GCCAGTTCGA GGCCCTGGCC GAACTCGATC CGCAGACCAC CCGGCGCGCC CTCGGCATGC ACCGCCGGGG CCCGGCCGCC CCCCTTCGAC CGACACGGAA GAAAAACGAC AGTGAAAAGC GCCCATGA
|
Protein sequence | MHTSLPRRLA RHLPCLEWAR RYDHETAGKD GLAALIVTLM LIPQSLAYAM LAGLPPVTGL YASILPLVAY TLFGTSRTLA VGPAAVLSLV TASVLAPLFA AGSAEYNAAA LLLALLSGIV LLAMAALRLG FLANFLSHPV ISGFMSASGI LITLGQLKHI LGIEADGENA IELLGALVRS LPQTNLPTLA IGIGSLFFLH LARSRLHGWL LARGFGAKIA GTLVRTGPVV ALLASVLLVW LFGLDAAGVR VVGQTPQGLP SFALPPLDAA LAGELLPAAL LISLIGFVES VSVAQTLAAR RRQRIEPNQE LVGLGAANLA AALSGGFPVT GGLSRSVVNF DAGAQTPMAG ALSAVGITVT VLFFTPLFHN LPHAVLAATI IVAILTLVDL GALGRTWRYS RQDAAAMAAT MLGVLLIDVE AGILIGVGLS LLLFLWRTSQ PHIAVVGQLP GSEHFRNVKR FAVVESPKVL SIRVDESLYF PNARYLEDRV AELVSQHPRA EHLVLMCPGV NLIDASALES LEEIGAHLHA AGIQLHLSEV KGPVMDRLRH SDFLEHFGGR VFISQFEALA ELDPQTTRRA LGMHRRGPAA PLRPTRKKND SEKRP
|
| |