Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_30540 |
Symbol | cysP |
ID | 7761954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3163324 |
End bp | 3164322 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643805930 |
Product | Sulfate ABC transporter-binding component-CysP-like protein |
Protein accession | YP_002800194 |
Protein GI | 226945121 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGAT TGCTGACCTC TTCCCTGCTG GCGGCCGGCG TGGCCCTGGC CTCCGTCGCC AGCCAGGCGG CCCCGTTACT GAACGTCTCC TACGACGTGA TGCGCGACTT CTACAAGGAG TACAACCCGG CCTTCCAGAA ACACTGGCAG GCCGAGGGCA ACCCACCGGT ACAGATCCAG ATGTCCCACG GCGGCTCCAG CAAGCAGGCG CGCGCGGTGA TCGACGGCCT GCCGGCCGAC GTCATCACCA TGAACATGGC CACCGACATC AATGCCCTGT ACGACCACGG CAAGCTGATT CCGCAGAACT GGGCCGAGCG CCTGCCGGAC AACAGCGCCC CCTTCACCTC GGCGACCGTG TTCATCGTCC GCAAAGGCAA CCCGAAACAG CTCAAGGACT GGCCCGACCT GCTCAAGGAG GGCGTGCAGG TGGTGGTGCC CAATCCCAAG ACCTCGGGCA ACGGCCGCTA CACCTACCTG TCGGCCTGGA GCTACGCACT GAAGAACGGT GGCGACGACA AGGCCGCGCG CGACTTCGTC GGCAAGCTGT TCAAGCAGGC GCCGGTGCTC GACACCGGCG GTCGCGCCGC TACCACTACC TTCATGCAGA ACCAGATCGG CGACGTGCTG GTGACCTTCG AGAACGAGGC GGAAATGATC GCCCGCGAAT TCGGCCGCGG CGGCTTCGAG GTGGTCTATC CCAGCATCTC CGCCCAGGCC GAACCGCCGG TGGCGGTGGT CGACAAGGTG GTCGACAAGA AGGGCACCCG CAAGGAGGCC GAGGCCTACC TGAAATACCT ATGGTCCGAC GAGGGCCAGC GCATCGCCGC CAACAACTAC CTGCGCCCGC GCAATCCGAA GATCCTCGCC GAATTCTCCG ACCGCTTCCC CAAGGTCGAA TTGCTCGACG TGGTGAAGAC CTTCGGCGAC TGGCCGACCA TCCAGAAGAC CCACTTCAAC GACGGCGGCG TGTTCGACCA GGTCTACGGC GGACGCTGA
|
Protein sequence | MKRLLTSSLL AAGVALASVA SQAAPLLNVS YDVMRDFYKE YNPAFQKHWQ AEGNPPVQIQ MSHGGSSKQA RAVIDGLPAD VITMNMATDI NALYDHGKLI PQNWAERLPD NSAPFTSATV FIVRKGNPKQ LKDWPDLLKE GVQVVVPNPK TSGNGRYTYL SAWSYALKNG GDDKAARDFV GKLFKQAPVL DTGGRAATTT FMQNQIGDVL VTFENEAEMI AREFGRGGFE VVYPSISAQA EPPVAVVDKV VDKKGTRKEA EAYLKYLWSD EGQRIAANNY LRPRNPKILA EFSDRFPKVE LLDVVKTFGD WPTIQKTHFN DGGVFDQVYG GR
|
| |