Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_36280 |
Symbol | |
ID | 7762523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3691703 |
End bp | 3694624 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643806495 |
Product | lipoprotein |
Protein accession | YP_002800750 |
Protein GI | 226945677 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCCTCT TCGAACGATT CCGCGGCCGC GCCCGGATGG CGGAGAAAGT CGAGCCCGAG GTCCAGATCG CCGACGTCGC TCAAGAGAGT CATGAGGCTC ATGAGGCTCA TGAGGCTCCC GACATCGAGA GGATCGAAGC CGCCACCGAG CGCCATCTCG AACGGCTGAC CCGCCTCGGC ATCCCCGATC CGACGGACTG GCGAGATCCC GGCAAACGCC CGGCGACCCA GGCGGACGTC GCCAGGCTCT ACGAGGTGGC GCCCTCCTTC GTCGACCTGT TGCCCTGGGT GGAATACCTG CCGGACGAGC AGGCGATGCT CCTCGAGGAC GGCCATTCGT GGGCAGCCTT CTTCGAACTG ACGCCGATCG GCACCGAAGG GCGCGACCCG GCCTGGCTGC GGATGGTTCG GGACGCCCTG GAAAACGCCC TGCAGGACAG TTTCGATGAA CTCGACAGCT CGCCGTGGGT GGTACAGATG TATGCCCAGG ACGAAACCAA CTTCGACGAT TACCTGCAGT CCCTGCGCAA CTACATCCAG CCCAGGGCCG CGGGGAGCGA CTTCACCGAG GTCTACCTGG CGTTGTTCAA GCGGCATCTG GAGGCGATCG CCAAGCCCGG CGGATTGTTC GAAGACACGG CCGTCAGCCA GTTGCCCTGG CGCGGCCAGC AGCGCCGGGT CCGCCTGGTG GTCTACCGCC GCGTCCAGGT ATCGGACATG GTCGTGCGTG GCCAGGCGCC CGCCCCCTAT CTCGCGGTGA TCTGCGATCG CCTGGTCGGT GCCCTGGCCA ACGCCGGCGT GGTCGCCAGA CGCATGGATG GCCAGGCCGT GCGCACCTGG CTCGTCCACT GGTTCAATCC CCGCCCCGAC CACCTGGGCG CGACGGATGC CGACATCCGC CGCTTCCTCG AACTGGTCTG CCAAGCCCCG GAGGGACTCT CCGAAGACGA CTTGCCGCTG GCCAGCGGCA CCGACTTCGC CCAGAACCTC TTCTACCGGG AACCGCAGTC CAGCACGCCC AAAGGGCTCT GGTATTTCGA TGCGATGCCG CATCGCGTGG TGGTCGTCGA CCGGCTGCGC GATGCGCCGA AGACCGGCCA CCTGACCGGC GAGACCCGCA AGAGCGATGC CTACAACGCC CTGTTCGACC GCCTGCCGGA AGACACCGTG CTCTGCCTCA CGATGGTCGC CACGCCCCAG GACCTGCTGG AAGGCCACCT GGAACAGCTC TCGAAGAAAG CGGTGGGCGA TACGCAGGCC TCGATCCACA CGCGCGACGA TGTGGAGCAC GTCCGCACAC TGCTCGGCCG CAAGCACAAG CTCTATCGGG GGAACCTCGC CTTCTATCTG CGGGGGACAG ACCACAGGCA GCTCGACGAA CGGACCACCC AGCTCAGCAA CGCCCTGCTC GGTGCCGGCA TGTCGCCGGT CCAGCCGCAG GACGAGGTCG CCCCGCTGAA CAGTTACCTG CGCTGGCTGC CCTGCAACTT CGACCCCAAC GAGCGGCATG CCCTGGACTG GTACGTCCAG TTCATGTTCG TCCAGCACAT CGCGAATCTG GCACCGATCT GGGGACGCTC GACGGGCACC GGCCACCCCG GTGTCACCCT GTTCAACCGC GGTGGCGCGC CGGTCACCTT CGACCCGCTG AACCGCCTCG ATCGCCAGAT GAATGCCCAC CTGTTCATCT TCGGTCCGAC CGGCTCGGGC AAATCGGCTT CGGCGACCAA CATCCTCAGC CAGGTCATCG CGATCTACCG GCCTCGCCTG TTCATCGTCG AGGCGGGCAA CAGCTTCGGC CTGCTGGGCG AGTTCGCGAA AAACCTGGGG CTCAGCGTGA ACCGCGCGCG TCTCGCTCCC GGTTCGGGCG TCAACCTGGC GCCCTTCGCC GATGCGATCA AGCTCATCGA AGCCCCCGGC AAGACGAAGG TCCTGGACAT GGAGGCCTCC GACGATCACC TGGCCGACAC CGAGGACGAG CAGCGGGACA TCCTCGGCGA GATGGAAATC ACCGCCCGCC TGATGGTCAC CGGCGGGGAG GCGAAAGAGG ACGCGCGACT GACGCGCGCC GACCGCAGCG CCCTGCGCCA GTGCATCCTG GACGCGGCCA AGACCTGCTC CGAGCAGGGA AAGACCGTCC TCCCGGAGGA TGTGCGCGAT GCTCTGCGCC GCATGGCGGC CGACGAGAAA ATACTTGAAC CCCGGCGCAA TCGCCTGATG GAAATGGCCG AGGCCATGTC CATGTTCTGC ATGGGCGCCG AAGGCGAAAT GTTCAACCGG CCGGGCAGCC CCTGGCCCGA GGCGGACCTC ACCATCGTCG ACCTGGCGAC CTACGCCCGC GAAGGCTACG AAGCGCAGAT GGCCATCGCC TACATCTCGC TGCTCAACAC GGTGAACAAC ATCGCCGAGC GCGACCAGTT CAAGGGGCGG CCGCTGATCT TCTTCACCGA CGAAGGCCAC ATCCAGCTCA AGGTGCCGCT GCTCTCTCCC TACGCGGTCA AGATCACCAA GATGTGGCGC AAGCTCGGCG CCTGGTTCTG GATGGCCACC CAGAACGTGG ACGACGTCCC CCCGGAGGCC TCCGCCCTGC TGAACATGAT CGAGTGGTGG ATCTGCCTGA ACATGCCGCC GGACGAGGTG GAGAAGATCG CCAGGTTCCG CGAACTGACG CCGGCGCAGA AGGCGATGAT GCTCTCGGCC CGCAAGGAGA ACGGCAAGTT CACCGAAGGC GTCGTGCTCG CCAAGCGCCT CGAACTGCTG TTCCGGGTGG TTCCGCCCAG CCTGTACCTG GCGCTCGCGA TGACCGAGCC GGAAGAAAAA AAGCAGCGCT ACGACATCAT GATGGCCAAG GGCTGCGACG AACTCGGCGC CGCCCTGGAA GTCGCCGCCG ACCTGGACCG CAAACGCGGC ATCACCTCCT GA
|
Protein sequence | MGLFERFRGR ARMAEKVEPE VQIADVAQES HEAHEAHEAP DIERIEAATE RHLERLTRLG IPDPTDWRDP GKRPATQADV ARLYEVAPSF VDLLPWVEYL PDEQAMLLED GHSWAAFFEL TPIGTEGRDP AWLRMVRDAL ENALQDSFDE LDSSPWVVQM YAQDETNFDD YLQSLRNYIQ PRAAGSDFTE VYLALFKRHL EAIAKPGGLF EDTAVSQLPW RGQQRRVRLV VYRRVQVSDM VVRGQAPAPY LAVICDRLVG ALANAGVVAR RMDGQAVRTW LVHWFNPRPD HLGATDADIR RFLELVCQAP EGLSEDDLPL ASGTDFAQNL FYREPQSSTP KGLWYFDAMP HRVVVVDRLR DAPKTGHLTG ETRKSDAYNA LFDRLPEDTV LCLTMVATPQ DLLEGHLEQL SKKAVGDTQA SIHTRDDVEH VRTLLGRKHK LYRGNLAFYL RGTDHRQLDE RTTQLSNALL GAGMSPVQPQ DEVAPLNSYL RWLPCNFDPN ERHALDWYVQ FMFVQHIANL APIWGRSTGT GHPGVTLFNR GGAPVTFDPL NRLDRQMNAH LFIFGPTGSG KSASATNILS QVIAIYRPRL FIVEAGNSFG LLGEFAKNLG LSVNRARLAP GSGVNLAPFA DAIKLIEAPG KTKVLDMEAS DDHLADTEDE QRDILGEMEI TARLMVTGGE AKEDARLTRA DRSALRQCIL DAAKTCSEQG KTVLPEDVRD ALRRMAADEK ILEPRRNRLM EMAEAMSMFC MGAEGEMFNR PGSPWPEADL TIVDLATYAR EGYEAQMAIA YISLLNTVNN IAERDQFKGR PLIFFTDEGH IQLKVPLLSP YAVKITKMWR KLGAWFWMAT QNVDDVPPEA SALLNMIEWW ICLNMPPDEV EKIARFRELT PAQKAMMLSA RKENGKFTEG VVLAKRLELL FRVVPPSLYL ALAMTEPEEK KQRYDIMMAK GCDELGAALE VAADLDRKRG ITS
|
| |