Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_29650 |
Symbol | |
ID | 7761866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3063603 |
End bp | 3065444 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643805838 |
Product | oligopeptide ABC transporter, periplasmic substrate binding protein |
Protein accession | YP_002800106 |
Protein GI | 226945033 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000703906 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCATT CCCTGCGCAC CCTGCTCCAC GGCGCCGGCC TGTTGTTGCT CGGTTTCGCC GGCCTGGTCC AGGCAGACCC GCAGCACGCC ATCACCCTGT ACGACGAGCC GCCCAAGTAC CCGGCCGACT ACCGGCACTT CGAGTACGTC AATCCGGACG CACCGAAGGG AGGCACGCTG CGCCTGGCCG ACTACGGTGG CTTCGACAGC CTCAACCCCT TCATTCCCAA GGGCAACGTG GAACGCCGCA TCGGCATGGT CTACGACAGC CTGACCTATC ATGCGCAGGA CGAGCCCTTC ACCGAATACG GCCTCATCGC CGAGAAGATC GAGAAGGCCC CGGACAACGG CTTCGTGCGC TTCTACATCA ATCCCAAGGC GCGCTTCCAC GACGGCCGGC CGATCACCGC CGAGGATGTG AAATTCACCT TCGAGACCCT GATCGAGCAC GGCGACCCGA TGTACCGCCA CTACTACGCG GACGTCGCCC AGGTGGTGGT CGAGGAGCCG CTGAAGGTAC GCTTCGACTT CAAGCACCGC GACAACCGCG AGCTGCCGCT GATCCTCGGC CAGTTGCAGA TCCTGCCCAA GCACTGGTGG GAAAGCCGCG ACTTCGCCAA GACCAGCCTG GAAGCGCCGC TCGGCAGCGG GCCGTACCGT GTCGCCAAGC TGGAGTCCGG TCGCTCCATC CGCTACGAGC GGGTCGCGGA CTGGTGGGCC AAGGACCTGC CGGTCTCCCG CGGCCAGTAC AACTTCGACG CCATCGTCGT CGACTACTAC CGCGACATGT CGGTCGCCCT GGAAGCCTTC AAGGGCGGGC AGTTCGACCT CAACCTCGAA TACTCCGCCA AGGATTGGGC CACCGGCTAC GAATCCGCCG CCCTCAATGA CGGCCGGATG ATCAAGAAGG CCGTCCCCAA CCACAACCCG GTCGGCATGC AGGCCTTCGC CTTCAACATC CGCCGGCCCC TCTTCCAGGA CCGCCGCGTG CGCGAGGCGC TCGGCCTGCT GTTCGACTTC GAATGGTCGA ACAAGCAACT GTTCTTCAGT TCCTACAAGC GCACCAGCAG CTACTTCGAG AACTCGGAAA TGGCCGCCCA CCAGTTGCCC GACAAGGAAG AGCTGAAGAT TCTCGAACCC TTGCGCGAAC AGTTGCCGCC GGAGGTGTTC AGCGAGGTCT ACCGGCCGCC GGTGACCAAC GGCGACGGCA TCATCCGCGA CCAGAAGCGC CGTGCCTACC AGTTGCTCCA GGAAGCCGGC TACCGCATCG AGAACGACCG CATGGTCGGC CCCGACGGCA AGCCGCTGGC CTTCGAATTC ATGCTCCACC AGACCAACCT GGAACGCATC CTGCTGCCCT ACAAGCGCAA CCTCGGCGAA CTCGGCATCG ACATGCAGAT CCGCCGCGTC GACGTTCCCC AGTACATCAA CCGCATGCGC AACCGCGACT TCGACATGAC CAGCGCCACC TGGCCGCAGT CCAACTCGCC GGGCAACGAG CAGCGCGAGT TCTGGCACTC CAGCAGCGCC GACAACCCCG GCAGCCGCAA CTTCATCGGC CTGCGCGACC CGGCCGTCGA CCGGCTGGTC GACGGGCTGA TCCGCGCCGA CTCGCGGAAA GGCCTGGTCG CCCACGCCCG CGCCCTCGAC CGTGCCCTGC AGTGGGGCTT CTACGTGGTG CCCAACTACC ATGTGAACAC TTGGCGCATC GCCTACTGGA ACAGGTTCGG CCAGCCGCAG AAGACGCCGC TGTACGACTA CGGCCTGATG ACCTGGTGGC AGGACAGCGA CAAGCCGCAG CCCCGGGACG AAGCGGTGGC GCACAAGGAG GAAGGTCGAT AG
|
Protein sequence | MTHSLRTLLH GAGLLLLGFA GLVQADPQHA ITLYDEPPKY PADYRHFEYV NPDAPKGGTL RLADYGGFDS LNPFIPKGNV ERRIGMVYDS LTYHAQDEPF TEYGLIAEKI EKAPDNGFVR FYINPKARFH DGRPITAEDV KFTFETLIEH GDPMYRHYYA DVAQVVVEEP LKVRFDFKHR DNRELPLILG QLQILPKHWW ESRDFAKTSL EAPLGSGPYR VAKLESGRSI RYERVADWWA KDLPVSRGQY NFDAIVVDYY RDMSVALEAF KGGQFDLNLE YSAKDWATGY ESAALNDGRM IKKAVPNHNP VGMQAFAFNI RRPLFQDRRV REALGLLFDF EWSNKQLFFS SYKRTSSYFE NSEMAAHQLP DKEELKILEP LREQLPPEVF SEVYRPPVTN GDGIIRDQKR RAYQLLQEAG YRIENDRMVG PDGKPLAFEF MLHQTNLERI LLPYKRNLGE LGIDMQIRRV DVPQYINRMR NRDFDMTSAT WPQSNSPGNE QREFWHSSSA DNPGSRNFIG LRDPAVDRLV DGLIRADSRK GLVAHARALD RALQWGFYVV PNYHVNTWRI AYWNRFGQPQ KTPLYDYGLM TWWQDSDKPQ PRDEAVAHKE EGR
|
| |