Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_18670 |
Symbol | |
ID | 7760801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1848053 |
End bp | 1849816 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643804765 |
Product | extracellular solute-binding protein |
Protein accession | YP_002799054 |
Protein GI | 226943981 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATT ATCCGAACGA CAGCTTTTCC CTTTCCTCCC TGGTCGGTCA CTCGCCCGGC AAGTCCAGTC GCCCCGAAGC GCCGGCGGCA TTTGTCCCCG GCCAGTCGAA CGGCACCGCG CCGGGCGCAC TGCGCAGTCT GGCGCGCCGC CTGGCCGGCC TGGTGATTCC CGTGGCCGCC ACGCTCGCCC TGGCCGGCTG CTCGCCGTCG GCCGAGGACG GCCAGGCCGC CAGGACCCTG AAGATCGCCT TCTGGGGCGA CAACACCGTG CTGGTCAGCG TCGATCCCTT CCAGGTCTAC TGGATCGAGC ATCGCGTGCT GTTGCGCAAC GTCGCCGAAT CGCTGACCGA CCAGGACCCG AAGACCGGCG AGATCATTCC TTGGCTGGCG AAAAGCTGGG AAGTGAGCGA CGACGCCCTG GAGTACACCT TCCACCTGCG CGAGGACGTC ACCTTCAGTA ACGGCGAGCG TTTCGACGCC CAGGCGGTGA AGATCGCCTT CGACAGCAAC AAGGCGTTCG CCGCCGAGGT GCCGTCGACT TTCGGCGCCA CCTACCTGGC CGGCTACGAG CATGCCGAGG TGCTCGACGC TTTCACCGTC AAGCTGGTGC TGTCGCGGCC CAATGCCGGT TTCCTGCAGG CCGCCTCCAC CACCAACCTG GCGATCCTCG CGCCCGCTTC CTACCGACTG ACGGCCAGGG AGCGTTCCCT CGGCAAGATC GTCGGCAGCG GCCCCTTCGT CCTGGAAAGC TACACCCCGG AAGTCGGCGC CAGACTGGTC AAGCGCAAGG ACTACGCCTG GCCTTCGGCG AACCTGAAGA ACCCCGGCGC GGCGCACCTG GACAGCGTCG AACTCAGCTA CGTGCCGGAG GAAAGCGTGC GCAACGGCCT GTTCCTGCAG GGGCAGGTCG ACATCCTCTG GCCGCGCAAC CCTTTCTCCG AGGTGGACCT GAAGCTGTTC CAGTCCAGGG GCGCCACCAT CCAGAGCCGT TCGCTGCCGG GGCCGGCCTT CAACCTTTAT CCGAACGCCC AGGACAAGCG TGTCCTGGCC GACCCCAGGG TACGCCTGGC GCTGCAGAAG GCGATCGACC GCAAGACCTA CGCCGCCACC ATCTACAACC CGGATTTTCC AGTGGTGGAC GGGGTGTACG ACCTGACCAC GCCTTATTTC AAGACCCAGG GCGCCAAGCT GGCCTACGAC CCGGCCGGCG CGGAGCGCCT GCTCGACGAG GCCGGCTGGG TCAAGGGCGC CGACGGCTAC CGGCAGAAGG ACGGCAAGCG CCTGAGCCTG ACCTACATCC TGTCGCCCGC CGAAACGGCC GGCGACGTGC TGGTTCAGGA TCAACTGCGC AAGGTCGGCA TCGAGCTGAA GCTCGACGTG CTCACCCGCG CCGAGCGGGT CACGGCCAAC GCCGCGGGCA ACTACGACCT GACCTCCAGC TACATGAGCC GTGCCGATCC GATCATCCTG CAGACCATTC TCGATCCGCG CACGGCCAAC AGCGCCGCCC TGGCCAGCAA CATCTATTCC CCGCAGACCC TGGAGCGCGC CACGGCGCTG TTCGACGCCG GCATCACCGC GACCGCCGGC GGGCAGCGCG CCCGCGCCTA TGGCGAACTG CAGGACCTGC TGATCGACGA GGGCCTGGCC TTCCCGATCT ACGAGCGCGT CTGGCAGGCC GCCACCGCGC CGCGCGTGCG CAACTTCCAG TGGTCCGCCG AGGGCTTCGC CTTCCTCAGC GACATCGAGG TGGACCAGCC ATGA
|
Protein sequence | MSDYPNDSFS LSSLVGHSPG KSSRPEAPAA FVPGQSNGTA PGALRSLARR LAGLVIPVAA TLALAGCSPS AEDGQAARTL KIAFWGDNTV LVSVDPFQVY WIEHRVLLRN VAESLTDQDP KTGEIIPWLA KSWEVSDDAL EYTFHLREDV TFSNGERFDA QAVKIAFDSN KAFAAEVPST FGATYLAGYE HAEVLDAFTV KLVLSRPNAG FLQAASTTNL AILAPASYRL TARERSLGKI VGSGPFVLES YTPEVGARLV KRKDYAWPSA NLKNPGAAHL DSVELSYVPE ESVRNGLFLQ GQVDILWPRN PFSEVDLKLF QSRGATIQSR SLPGPAFNLY PNAQDKRVLA DPRVRLALQK AIDRKTYAAT IYNPDFPVVD GVYDLTTPYF KTQGAKLAYD PAGAERLLDE AGWVKGADGY RQKDGKRLSL TYILSPAETA GDVLVQDQLR KVGIELKLDV LTRAERVTAN AAGNYDLTSS YMSRADPIIL QTILDPRTAN SAALASNIYS PQTLERATAL FDAGITATAG GQRARAYGEL QDLLIDEGLA FPIYERVWQA ATAPRVRNFQ WSAEGFAFLS DIEVDQP
|
| |