Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_21910 |
Symbol | asfC |
ID | 7761109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2187847 |
End bp | 2188836 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643805076 |
Product | ABC transporter, substrate-binding protein, aliphatic sulphonate |
Protein accession | YP_002799357 |
Protein GI | 226944284 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.58658 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACG CTAGCCGTCT CTGTCTCACC CTGGCCGGGC TGCTGTCCTG CTCGGGAATC GCCTGGGCGC AGAACCTGCA AGCCCTGCGG GTGGCCAATC AGAAATCCGG CATCAAGCTG CTCCTGGAGG CGGCCGGGGA ACTGCAAGAG GTGCCCTACG CCATCCAGTT CTCCGAATTT CCGGCGGCCG CGCCGCTGGG CGAGGCGCTG AACGCCGGCG CGGTGGATGT CGGCGGCCTG GGCGACGCGC CCTACGTTTT CGCCCTGGGC AGCGGTGCAG CGCTGAAGGT CGTCTGCATC GTCCATGCGG CCGGCCGCCT GAGCACGGCG ATCATCGTGC CCAAGGACTC GCCCCTGCAC GGCGTCGCCG ACCTGAAGGG TAAGCGCATC GTCACCGGAC GCGGCTCGAT CGGGCATTTC CTGGCGCTCA AGGCCCTGCG CGAGGCGGGA CTGCAAAGCA GCGACGTACG CTTCGTCAAC CTGCTGCCCA GCGACGCGCG CAGCGTCCTG GAGAGCGGCG GCGCCGACGC CTGGTCGACC TGGGACCCGT ACACCGCCAT CGCCATCACC CAGGGCGCCC GGGTGCTGGT CAACGGCAGC CACCTGCTCA GCAACAACTT CTATCTGGCG GCGACCGCCC AGGCCATCGA GGACAAACGC CCGCAACTCA CGGACTTCGT GAAGCGGCTG GAGCGCGCCT ATCGATGGGC CAACCAGCAT CCGGACGCCT ACGCCGCCGC CCAGTCCAGG GTCACCGGCC TGTCCCGCGA GACGCACCTG GAGTCGGCCA GGAATACCCG TTTCCAGCGG GTCCCGATCG ACGATGCGCT GATCGAGGGT CTGCAGGCGA CCGCCGACCT GTATTTCGAG GAAGGCATCA CCGGCAAGCG AATCGAGGTT TCGCAGGGCT TCGACAGGAG TTTCAACGAG GCGGCCGACG GACCGCTTCT ACCCGCCCCG TCCCGGGTCC AGGCCTCCGG CCGCCCATGA
|
Protein sequence | MKNASRLCLT LAGLLSCSGI AWAQNLQALR VANQKSGIKL LLEAAGELQE VPYAIQFSEF PAAAPLGEAL NAGAVDVGGL GDAPYVFALG SGAALKVVCI VHAAGRLSTA IIVPKDSPLH GVADLKGKRI VTGRGSIGHF LALKALREAG LQSSDVRFVN LLPSDARSVL ESGGADAWST WDPYTAIAIT QGARVLVNGS HLLSNNFYLA ATAQAIEDKR PQLTDFVKRL ERAYRWANQH PDAYAAAQSR VTGLSRETHL ESARNTRFQR VPIDDALIEG LQATADLYFE EGITGKRIEV SQGFDRSFNE AADGPLLPAP SRVQASGRP
|
| |