Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_21640 |
Symbol | |
ID | 7761084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 2164935 |
End bp | 2165921 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643805052 |
Product | nitrate/sulfonate/bicarbonate ABC transporter, substrate binding protein family 3 |
Protein accession | YP_002799333 |
Protein GI | 226944260 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.800377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGATG AATATCCTTC GATGAATCGC CGTCGCATCC TGTTGACGGG AAGCCTGCTG GCGGGGGCGG CGCTGGCGAG CCGCTGGGGC GTGGCCGCGA GCGGCCCGCT GGATCTGTCC CGCACCCGGT TGGGGGTCGG TACCTACAAG GGCGCCGCGC CTTCCTTTTT CGAGGAAGCC GGTGTGGCGC CGCCTCCCTA TGAAGTCAGC TACGCGGAAC TGGCCGGCGG CAATCTGATC GTCGAGGCAC TGGCCGCCGG CAGCCTGGAT GTCGGCAGCA TGAGCGAAAT CCCGCCAGTG TTCGCCATCC GGAGCCGGGC GCCCATCAAG CTGATCGCCG TGCTGCGTGG CGACGTGAAC AATCAGGTCT TCCTGGTGCC GCGGGGCTCC GGCATCCGGG AGATCGCCGA ACTGAAGGGC AAGCGCGTCG GCTTCGTGCG TTCGACGACG TCCCACTACT TCCTGATCAA GGCCTTGAAG GAGCAGGGCC TGAGCCTGGC CGACATCGAG CCGGTGGGGC TGACGCCGCA GGACGGCTTT TCCGCCTTCC AGAGCGGACA ACTGGATGCC TGGGTGATTT ATGGCATCCA TATCCAGATC GCGCTCGCCA GGACCGGCGC GCGCATCATC AAGACCGCGC TGGGTTATCT GTCGGGCAAT TACGTGATCG CCGCCCGCGC CGAATCCCTG AAGGACCCCT ACCGGGTCGC GGCGATCAAG GATTATCTGG CGCGGGAACA GCAGGTATGG GACTGGGTGC AGGCCAACCC GGAGCGCTGG GCGAAGAAAA GCGCGGCGAT CACCGGCATC GACGCTTCCC TGTTCATGGA TCAGTTCCGG GCACACAGCG AGCCTTATCG CATGGTTCCC GTCGACGACG CGGCGATCGC TTCCCAGCAG GAGGTGGCGG ATCTGTTCCA CGAGGCCGGC GTACTGGACC AGCGGCTGGA TGTCGCCCCC CTGTGGACCA GGGACGTCTG GCCCTAA
|
Protein sequence | MPDEYPSMNR RRILLTGSLL AGAALASRWG VAASGPLDLS RTRLGVGTYK GAAPSFFEEA GVAPPPYEVS YAELAGGNLI VEALAAGSLD VGSMSEIPPV FAIRSRAPIK LIAVLRGDVN NQVFLVPRGS GIREIAELKG KRVGFVRSTT SHYFLIKALK EQGLSLADIE PVGLTPQDGF SAFQSGQLDA WVIYGIHIQI ALARTGARII KTALGYLSGN YVIAARAESL KDPYRVAAIK DYLAREQQVW DWVQANPERW AKKSAAITGI DASLFMDQFR AHSEPYRMVP VDDAAIASQQ EVADLFHEAG VLDQRLDVAP LWTRDVWP
|
| |