Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_03760 |
Symbol | |
ID | 7759336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 355983 |
End bp | 356972 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643803300 |
Product | ABC transporter, substrate-binding protein, aliphatic sulphonate |
Protein accession | YP_002797611 |
Protein GI | 226942538 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCAAC CTTCGGACGG CCTGTCGCGC CGCCGCTTTC TCGCCGGCAG CGCCGGCGCC CTGGCCCTGT CGCCGCTGCT GCTCCACGGC CGGCGCGCCG GGGCGGCCAC GCCGGGCCGG CTGCGCGTGG CCCAGTACAA GGGCGGCGAC AAGCTGCTGC TGGAGGCCGC CGGCCTCGCC GACACGCCCT ACCCCATCGA CTGGGCGGAG TTCGCCTCGG GCAACCTGAT GGTCGAGGCG ATGAACGGCG GTTCGCTGGA TCTCGCCTAC GGCAGCGAGA TCCCGCCGCT GTTCGGCTAC CTCAAGGGCG CGCGCATCCG CGTGGTCGGA GTGATCAAGG GCGACGTCAA CGAACAGACG GTGCTGGTGC CGAAGGATTC GCCGATCCGC TCCATCGCCG ATCTGAAGGG CAAGCGCGTC GGCTACGTGC GCGCCACCAC CACCCAGTAC TACCTGACCA AGATGCTCGA CGAGGTCGGC CTGAGCTTCG CCGACATCCA GGCGATCAAC CTCACGGTGC CCGACGGCGC CGCCGCCTTC CGCACCGGCC AGCTCGACGC CTGGGCCATC TACGGCTATT CGGTGCCGCT GGCGCAGACC TCGGTCGGCG CCCGGGTGCT CAAGCGCGCC AACGGCTACC TGTCGGGCAA CTATCTGTTC TTCGCTGCGC CGGAGGCCAT CGCCGATCCG CAGCGCCAGG CGGCGATCGC CGACTATTTC GCGCGCCTGC AGAAGGCCTT CGCCTGGCGC CAGGCCAACC ACGAACGCTA CGCCGCGGCG CTCGCCGCGG AGATCGGCGT GCCGATCGAG GCGGTGCTCA CCCTGCTGCG CAACGAGAGC CAGGTGCGCC GCCTGGTAGC GGTGGACGAT GAGGCGATCC GCAGCCAGCA GGACGTGGCC GATACCTTCC ACAAGGCCGG GGTGATCGAG CGGTCGGTGG ACGTGCGTCC GCTGTGGGAC CGCAGTTTCG CGACCGCGTT CGCCGGCTGA
|
Protein sequence | MSQPSDGLSR RRFLAGSAGA LALSPLLLHG RRAGAATPGR LRVAQYKGGD KLLLEAAGLA DTPYPIDWAE FASGNLMVEA MNGGSLDLAY GSEIPPLFGY LKGARIRVVG VIKGDVNEQT VLVPKDSPIR SIADLKGKRV GYVRATTTQY YLTKMLDEVG LSFADIQAIN LTVPDGAAAF RTGQLDAWAI YGYSVPLAQT SVGARVLKRA NGYLSGNYLF FAAPEAIADP QRQAAIADYF ARLQKAFAWR QANHERYAAA LAAEIGVPIE AVLTLLRNES QVRRLVAVDD EAIRSQQDVA DTFHKAGVIE RSVDVRPLWD RSFATAFAG
|
| |