Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_48060 |
Symbol | |
ID | 7763668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4869857 |
End bp | 4871434 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643807650 |
Product | extracellular solute-binding protein |
Protein accession | YP_002801885 |
Protein GI | 226946812 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCTCG CCCTTCTCCC CCTGCTGTTC GCCCCGTTGC TGGCCCAGGG CGCCAGCCTC GTCGTCTGCA CCGAGGCCAG CCCCGAGGGG TTCGATGTGG TGCGCTACAA CTCGTTGACC ACCACCAATG CCTCCGCCGA CGTGCTGATG GACCGCCTGG TCGAATTCGA TCCGCAGAGC GGCCAGTTGC AGCCCGGCCT GGCAAGGAGC TGGCAAGTCT CCACGGATGG CCTGGTCTAC GACTTCCGGC TGCGCGAGGG CGTGGCCTTC CATCACACGC CCTGGTTCAG CCCGAGCCGG ACCTTGAATG CCGAGGACGT GCTGTTCAGC TTCCAGCGCA TGCTCGACCC CGGCCACCCC TGGCACCGAA GCGCACCGAG CGGCTATCCG CACGCCCAGT CGATGCAGCT CGGCGAACTG ATCCGCAAGG TCGAAGCCGT CGATCCGCTG CACGTGCGCT TCACCCTCGC CCGTCCCGAC GCCACTTTCC TCGCCACCCT GAGCATGGGC TTCGCCTCCA TCTACTCGGC CGAGTACGCC GCCCGGCTGC TGGCCGCCGG TACGCCGGAG AAACTCGACA GCCAGCCGGT CGGCACCGGT CCCTTCGTTC TCAAGCGTTA TCAGAAGGAC GCGCTGGTGC GCTATCTGGC CAACCCCGAC TACTTCGCCG GAAAGCCCGC GGTCGACGGG CTGGTACTGG CCATTACGCC GGATGCAAAT GTGCGCCTGC AGCGCCTGCG CCGCGGCGAA TGCCAGATCG CCCTGTCGCC CAAACCGCAG GACGTGCGCG CGGCGCAGGA CGATCCGAGT CTCGCGGTGG CCGCCACGCC GGCTTTCATG ACCGCCTTCG TCGCCCTCAA CAGCCAGCAC CCGCCACTGG ACAGACCCGC GGTGCGCCAG GCGATCAACC TCGCCTTCGA CAGGACCAGC TACCTGAAAG CGGTGTTCGA AAACAGCGCG CAGCCGGCCG AAGGGCCTTA TCCGCCTACC ACCCGGAGCC ATGCCACCGA CCTGCCGGGC TATCCCCACG ACCCGGCCAA GGCCCGCGAG TTGCTGGCCG GCGTCGGGCT GGCGGAGGGC TTCAAGACCA CTATCTGGAC ACGCCCGGGC GGCAGCCTGC TCAACCCCAA TCCCACGCTC GGCGCCCAGT TGCTGCAGGC CGACCTCGCC CAGGTCGGCA TCCAGGCGGA GATCCGGGTG ATCGAGTGGG GCGAGTTGAT ACGCCGCGCC AAGGCTGGCG AACACGATCT GCTGTTCATG GGCTGGGCCG GCGACAACGG CGATCCGGAC AACTTTCTCA CCCCGCAGTT CGCCTGCGCC TCGGTCGAGT CGGGACTCAA CTTCGCTCGC TACTGCGACC CGAAACTGGA CCGACTGATC GCCGAAGGCA AGCGCAGCAG CGACCAGGCG GAACGCACCC GGCTATACGA GGAAGCGCAG AAGCTGATCC ACGAACAGGC GCTCTGGGTG CCGCTGGCCC ACCCCACCGC CGCCGTACTG CTGCGCAAGG GTGTCGAGGG CTACAGGGCC AACCCGTTCG GGCGCCAGGA TTTCGGCAAG GTACGACTGG ATCGCTGA
|
Protein sequence | MRLALLPLLF APLLAQGASL VVCTEASPEG FDVVRYNSLT TTNASADVLM DRLVEFDPQS GQLQPGLARS WQVSTDGLVY DFRLREGVAF HHTPWFSPSR TLNAEDVLFS FQRMLDPGHP WHRSAPSGYP HAQSMQLGEL IRKVEAVDPL HVRFTLARPD ATFLATLSMG FASIYSAEYA ARLLAAGTPE KLDSQPVGTG PFVLKRYQKD ALVRYLANPD YFAGKPAVDG LVLAITPDAN VRLQRLRRGE CQIALSPKPQ DVRAAQDDPS LAVAATPAFM TAFVALNSQH PPLDRPAVRQ AINLAFDRTS YLKAVFENSA QPAEGPYPPT TRSHATDLPG YPHDPAKARE LLAGVGLAEG FKTTIWTRPG GSLLNPNPTL GAQLLQADLA QVGIQAEIRV IEWGELIRRA KAGEHDLLFM GWAGDNGDPD NFLTPQFACA SVESGLNFAR YCDPKLDRLI AEGKRSSDQA ERTRLYEEAQ KLIHEQALWV PLAHPTAAVL LRKGVEGYRA NPFGRQDFGK VRLDR
|
| |