Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_43960 |
Symbol | |
ID | 7763269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4446152 |
End bp | 4447219 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643807251 |
Product | periplasmic substrate binding protein |
Protein accession | YP_002801492 |
Protein GI | 226946419 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.400414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATGA TCGCAAACCT GCCCGGCAGA CTCCTGATCC TCCTCTCGCT GGCGCTGCTG GCGGCCTGTG GCGCGGACCC CGAGCCGCCC CGCCCGTCCG CCGGCGAGGC GCCGGACCTG TCCGGGGTGG TGCTGCGGGT CGGCGCGCCG AACAAGATCG GCAACCGTCC CTACCTGCAG GTGGCCGGGC AACTGGAGAA CCTGCCCTAC CGCATCGAAT GGTCGGAATT CTCCGCCACT CCGGCGCTCC TGGAGGCGCT GCGCGGCGGC AACATCGACA TCGGCGGCAA CGGCGGCTCC ACCGGCATCC TCTTCGAGGT CGCCAACAAC ACCGACAGCG GCATCAAGGT GATCGCCGCC GGACGCGCGG TCTCGACGGG CGCCGGCGCC GCCATCCTGG TGCGCGGCGA CTCGCCCTAC AAAGGCATCG CCGACCTCGG GGGCACCCGC TTTTCCGTGG TCAAGGGCAC CGGCAGCCAG TACCTGCTGG GCCAGGCGCT GAAAAAGGAA AACCTCGGCC TGGACGACCT GCAACTGCTC CATTTGACCA ACGACGCCGC CCTGGCCGCC CTGCTCGCCG GACACATCGA CGCCTGGGCC ACCTGGGACC CCCAGGCCAG CGTGCTGCAG TCGCACCCCG ACCTGCGCCT GCTGGGCTGG ATCGGCAACC CGGACGACTC CTGGACCATC CAGTACGCCT CGCAACAAGC CCTCGACGAT CCCGGCAAAC GCGCGGCGAT CGCCGACTTC CTGGGGCGCC TGGCCTATTC CACGGTCTGG GTCGGCCGCC ATCCCGAGCA ATGGGCCGAG GTCAGCGGGC AACTGACCCG GATCGATCCG GCCGTGATGC TCGGGATCGC CCGCAAGACG CGTACCGAAT ACGGCCTGGA CGGGGAACAG CGGACGGCGC TGGAAGCCTC CTTCCGGCGC GAGGCCGATT TCTGGCGGAG CATCGACGTG ATCGGCGCGG CCCCGGAGAT TCCCCGGCTG TTCGACCACC GCTTCAACGA ACGGATGCTC GAAGCCACCC GCCGGGCCCG ACAGACACTC GGCGAAACCG CGCCGTGA
|
Protein sequence | MSMIANLPGR LLILLSLALL AACGADPEPP RPSAGEAPDL SGVVLRVGAP NKIGNRPYLQ VAGQLENLPY RIEWSEFSAT PALLEALRGG NIDIGGNGGS TGILFEVANN TDSGIKVIAA GRAVSTGAGA AILVRGDSPY KGIADLGGTR FSVVKGTGSQ YLLGQALKKE NLGLDDLQLL HLTNDAALAA LLAGHIDAWA TWDPQASVLQ SHPDLRLLGW IGNPDDSWTI QYASQQALDD PGKRAAIADF LGRLAYSTVW VGRHPEQWAE VSGQLTRIDP AVMLGIARKT RTEYGLDGEQ RTALEASFRR EADFWRSIDV IGAAPEIPRL FDHRFNERML EATRRARQTL GETAP
|
| |