Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_12520 |
Symbol | |
ID | 7760195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1218510 |
End bp | 1220123 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643804155 |
Product | dipeptide ABC transporter, periplasmic substrate-binding component |
Protein accession | YP_002798454 |
Protein GI | 226943381 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0296833 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCTCC GACGCCCGAC CGCCCTCCTG CTCGCCGGCC TGCTCGCCGG CGCCGTCCAG GCCGCGCCGA GCAGCCTGGT GGTCTGTACC GAGGGCAGCC CGGAAGGCTT CGACATCGTC CAGTACACCG CCGCCGTCAG CGCCGACGCC TCGGCGGAGA CGGTGTTCGA CCGCCTGCTG CGCTTCGCTC CCGGCGGCAG CGAACTGCTC CCCGCCCTCG CCGAGCGCTG GGAGGTGTCC GCCGACGGCC TCGAATACAC TTTCCACCTG CGCCGCGGCG TGAAATTCCA GCGCACGCCC TGGTTCATGC CGAGCCGCGA ATTCAACGCC GACGACGTGA TCTGGAGCTT CCGGCGGCAG ATCGATCCCG CGCATCCCTG GCACAAACTG TCGCCGCGCG GTTTCCCCTA TGCCGAATCG ATGGCCGTGG GCGAGCTGAT AGAACGCATC GAGCGCCTCG ATGAGCACCG CGTGCGCTTC GTCCTGCGCC ACCCGGAGGC GCCCTTCCTG GCCAACCTGG CGATGGGCTT CGCCTCCATC TACCCGGCCG AATACGCCGA TCGACTGCTG GCCGCCGGCA CCCCGGAGCG CCTCAACAGC CAGCCGGCGG GCAGCGGCCC CTTCATCTTC GAACGCTACG AGAAGGACGC CCAGGTGCGC TTCCGGCGTA ACCCGGACTA CTGGGACGGC GCGCCGGCCA TCGAGCGGCT GATCTTCGCC ATCACCCCCG AGCCCAACGT GCGGGTGCAG AAGCTCAAGG CCGGCGACTG CCAGATCGCC CTCTACCCGC GCCCGGTGGA CCTGCCCGGC CTGCGCCGGG ACGTGCGCAT CCAGGTGCTG GAGGACGAAC CGCTGCTGAC CGCCTACATC GGCATCAACA CCCGTCATCC GCCGCTCGAC GACGTGCGGG TGCGCCAGGC GCTCAACCTC GCCTTCGACA AGTCCGCCTA CCTGCGCGCC CAGTACGGCG AGGGCGGCGC CAGCCCGGCG GTGGCACCCT ACCCGCCGAG CCTGTGGGGC TCCGACCCGA CGCTCGCCGG CTGGCCCCAC GACCCGGCGC GAGCCCGCGC CCTGCTGGCC GAAGCCGGAC ACGCCCAAGG CCTGAAGCTG AGTATCTGGA CCCGTCCCGG CGGCGGCCCG ACCAACCCCA ACCCCGGCAT CGGCGCCCAG TTGCTGCAGG CCGACCTGGC CGCCATCGGC ATCCGGGCGG AAATCCGCGT GCTGGAATGG GGCGAGCTGA TCAAGCGGGC GAAGAACGGC GAGCACGACC TGGTGTTCAT GGGCTGGGCC GGCGATAACG GCGACCCGGA CAACTTCCTG ACCCCCAATC TGTCCTGCGC CGCGGCCGCC TCGGGCGAAA ACCAGGCCGG CTGGTGCGAC GAGCGCTTCG ACGCCCTGTT GCGCGAGGCG CGCCGCACCA CCGACCAGGC GCAACGCACC GCGCTCTACC GCCAGGCCCT GGCGATCTTC CACGAACAGG CGCCCTGGAT TCCCCTGGCC CATCCCCGCG AGTTCGCCGC CGTGCGCCGC GACGTCGAGG GCTTCGTGAT CAGCCCGCTG GGCACCAACA ACTTCGCCGG CGTGCGTCGC GCCCCGGCCA ACGCTCCGGA GTGA
|
Protein sequence | MMLRRPTALL LAGLLAGAVQ AAPSSLVVCT EGSPEGFDIV QYTAAVSADA SAETVFDRLL RFAPGGSELL PALAERWEVS ADGLEYTFHL RRGVKFQRTP WFMPSREFNA DDVIWSFRRQ IDPAHPWHKL SPRGFPYAES MAVGELIERI ERLDEHRVRF VLRHPEAPFL ANLAMGFASI YPAEYADRLL AAGTPERLNS QPAGSGPFIF ERYEKDAQVR FRRNPDYWDG APAIERLIFA ITPEPNVRVQ KLKAGDCQIA LYPRPVDLPG LRRDVRIQVL EDEPLLTAYI GINTRHPPLD DVRVRQALNL AFDKSAYLRA QYGEGGASPA VAPYPPSLWG SDPTLAGWPH DPARARALLA EAGHAQGLKL SIWTRPGGGP TNPNPGIGAQ LLQADLAAIG IRAEIRVLEW GELIKRAKNG EHDLVFMGWA GDNGDPDNFL TPNLSCAAAA SGENQAGWCD ERFDALLREA RRTTDQAQRT ALYRQALAIF HEQAPWIPLA HPREFAAVRR DVEGFVISPL GTNNFAGVRR APANAPE
|
| |