Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_28020 |
Symbol | |
ID | 7761707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2889430 |
End bp | 2891634 |
Gene Length | 2205 bp |
Protein Length | 734 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643805681 |
Product | hypothetical protein |
Protein accession | YP_002799949 |
Protein GI | 226944876 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03549] conserved hypothetical protein TIGR03549 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.458293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATTA AGGTTAATTT TCTCGATAAC CTTCGGCTAG AAGCCAGGTT CGATGATTTC ACGGTGATCG CCGACCAGCC GATCCGCTAC AAAGGCGATG GCTCCGCACC GGGTCCGTTC GATTACTTCC TGGCTTCATC GGCTTTGTGC GCGGCTTACT TCGTAAAGTT GTACTGCCAG ACGCGCGATA TCCCCACCGA TAATATCCGC CTGTCGCAGA ACAACATTGT CGATCCGGAG AATCGCTACA AGCAGATCCT CAAGATCCAG GTGGAGTTGC CGGCGGATAT CTCGGAGAAG GATCGCCTGG GCATCCTGCG TTCCATCGAC CGCTGCACGG TGAAGAAGGT CGTGCAGACC GGGCCTGACT TCGTGATCGA GGAAGTGGAA AACCTCGATG CCGATGCCCA GGCGTTGTTA ATGCTCAACC CCGACGCCGA CGCGAGCACC TACATCCTGG GCAAGGATCT GCCGCTGGAG CAGACCATCG CCAACATGTC GGAAATCCTC GCCGGCCTGG GCATGAAGAT CGAGATCGCG TCGTGGCGCA ACATAGTGCC CAACGTGTGG TCGCTGCATA TCCGCGATGC GCAATCGCCG ACGTGCTTCA CCAACGGCAA GGGTGCGACC AAGGAAAGCG CGCTGGCATC GGCTCTGGGC GAATTCATCG AGCGCCTGAA CTGCAATTTC TTCTACAACG ACCAGTTCTG GGGAGAAGAC ATCGCCAATG CGGCGTTCGT GCACTACCCG GATGAGCGCT GGTTCAAGCC TGGCCGCAAG GATGCGCTGC CGGCTGAAAT CCTCGACGCG TACTGCCTGG AAATCTACAA CCCCGACGAC GAGTTGCGTG GTTCGCACCT GTACGACACC AACTCCGGCA ATGTGGAGCG CGGCATCTGT TCGCTGCCGT TCGTGCGCCA GTCCGACGGC GAGGTGGTGT ACTTCCCGTC CAACCTGATC GAGAACCTGT ACCTCAGCAA TGGCATGAGT GCCGGCAATA TGCTGGCCGA AGCGCAGGTG CAGTGCCTGT CGGAAATCTT CGAGCGCGCG GTGAAGCGTG AAATCCTCGA AGGTGAACTC GCCCTGCCCG ATGTGCCGCA CGAGGTGCTG GCGAAGTACC CGGGCATCCT GGCCGGTATC CAGGGCCTGG AAGAACAGGG CTTCCCGGTG CTGGTCAAGG ATGCGTCGCT GGGCGGCGAG TTCCCGGTGA TGTGCGTGAC CTTGATGAAC CCGCGTACCG GCGGCGTGTT CGCCTCGTTC GGCGCGCACC CGAACTTCGA GGTGGCACTG GAGCGCAGCC TGACGGAGTT GCTGCAGGGC CGCAGCTTCG AAGGCCTGAA CGACCTGCCG GCGCCGACCT TCGAAAGCCA CGCCTTGACC GAGCCGAACA ACTTCGTCGA ACACTTCATC GACTCCAGCG GCGTGGTGTC GTGGCGCTTC TTCAGCGCCA AGGCCGACTT CGAATTCGTC GAGTGGGACT TCTCCGGCCA GGGGGAAGAT TCCAATGCCG AGGAAGCCGC CACCCTGTTC GGCATTCTCG AAGACATGGG CAAGGAAGTG TACATGGCGG TGTACGAGCA CCTGGGGGCC ACGGCGTGCC GCATCCTGGT GCCAGGGTAT TCGGAGATCT ATCCGGTAGA GGATCTGATC TGGGATAACA CCAACAAGGC GTTGTCGTTC CGCGAGGACA TCCTGAACCT GCACCGCCTG GACGATGCCC GCCTCAAGGC ACTGCTCAAG CGTCTGGAAA ACTGCGAGGT GGATGACTAC ACCGACATCA CCACCCTGAT CGGTATCGAG TTCGACGACA ACACGGTCTG GGGCCAGTTG ACCCTCCTCG AACTGAAACT GCTGATCAGC CTCGCCCTGC GTCGCTTCGA AGACGCGAAG GAACTGGTGG AAGCCTTCCT GCAGTACAAC GACAACACGG TCGAGCGGGG GCTGTTCTAC CAGACCCTGA ACGTAGTGCT GGAAGTGGTG CTGGACGAAG AGCTGGAGCT GGCCGACTAC GAGGTCAACT TCCGCCGGAT GTTCGGCAAC GAGCGGATGG ACGCGGCGCT GGGGTCGGTG GATGGCAGCG TGCGCTTCTA CGGTCTGACG CCGACCAGCA TGAAGCTGGA AGGGCTCGAC AGGCACCTGC GCCTGATCGA CAGCTACAAG AAGCTGCACG GGGCGCGGGC CAGAGTGGCG GCTTTATCCC AATAA
|
Protein sequence | MEIKVNFLDN LRLEARFDDF TVIADQPIRY KGDGSAPGPF DYFLASSALC AAYFVKLYCQ TRDIPTDNIR LSQNNIVDPE NRYKQILKIQ VELPADISEK DRLGILRSID RCTVKKVVQT GPDFVIEEVE NLDADAQALL MLNPDADAST YILGKDLPLE QTIANMSEIL AGLGMKIEIA SWRNIVPNVW SLHIRDAQSP TCFTNGKGAT KESALASALG EFIERLNCNF FYNDQFWGED IANAAFVHYP DERWFKPGRK DALPAEILDA YCLEIYNPDD ELRGSHLYDT NSGNVERGIC SLPFVRQSDG EVVYFPSNLI ENLYLSNGMS AGNMLAEAQV QCLSEIFERA VKREILEGEL ALPDVPHEVL AKYPGILAGI QGLEEQGFPV LVKDASLGGE FPVMCVTLMN PRTGGVFASF GAHPNFEVAL ERSLTELLQG RSFEGLNDLP APTFESHALT EPNNFVEHFI DSSGVVSWRF FSAKADFEFV EWDFSGQGED SNAEEAATLF GILEDMGKEV YMAVYEHLGA TACRILVPGY SEIYPVEDLI WDNTNKALSF REDILNLHRL DDARLKALLK RLENCEVDDY TDITTLIGIE FDDNTVWGQL TLLELKLLIS LALRRFEDAK ELVEAFLQYN DNTVERGLFY QTLNVVLEVV LDEELELADY EVNFRRMFGN ERMDAALGSV DGSVRFYGLT PTSMKLEGLD RHLRLIDSYK KLHGARARVA ALSQ
|
| |