Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_42570 |
Symbol | |
ID | 7763132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4289997 |
End bp | 4291598 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643807112 |
Product | hypothetical protein |
Protein accession | YP_002801355 |
Protein GI | 226946282 |
COG category | [S] Function unknown |
COG ID | [COG2187] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0138648 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCTTT CCGCCCGACA GCCTCGACAA GGACTGACCG TGAGCCAAGC CCTGATCAAC GCCCTGCAGA ACCCCGCGCT CTATCCGCAC GCGGTGACTG AGTTCAAGGT CATCGAGACC CACATCTCCT GGGTCCTGCT CACCGGCCCC TATGCCTACA AGTTCAAGAA AGCCGTCGAC TTCGGCTTCC TCGACTTCAC CGGCCTGGCG GCACGCAAGC ACTTCTGCGA GGAAGAGGTG CGCCTGAACC AGCGCCTGAC CCGCGACCTT TACCTCGAAG TCCTGCCGAT CACCGGCAGC GAGAACGCCC CGCAACTGGC CGGCGACGGT CCGGTCATCG AATACACGGT CAAGATGCGC CAGTTCCCGC AGAGCCAGTT GCTCGGCGAA ATGCAGACCC GCGGCGAACT CGCCCCCGAG CACATCGACG CCCTGGCGGA ACGGATCGCC AGCTTCCACC TGCAGGCGCC GAAGGTGCCG GCGGAACATC CGCTGGGCGG AGCGGAAGCC GTGATGAGCC CGGTGCGGCA GAACTTCGAG CAGGTTCGCC CGCTGCTCGC CGAACCGGCC GACCTGCAGC AGCTCGATGC CCTTGAGGCC TGGGCCGAGA CCAGCTTCGA GCGCCTGCGG CCGCTGCTCG AGCAGCGCAA GGCCGACGGC TTCATCCGCG AATGCCATGG CGACCTGCAC CTCGGCAACG CCACCCTGCT CGACGGGCAG GTCGTGCTGT TCGACTGCAT CGAGTTCAAC GAGCCGTTCC GCCTGATCGA CGTCGTGTGC GACGCCGCCT TTCTCGCCAT GGACCTGGAG GACCGCGGCC TCAAGGGGTT CTCCCGGCGC TTCGTCAGCC GCTGGCTGGA GTTGACCGGC GACTACGCGG CCCTGCCGCT GCTGAACTTC TACAAGGCCT ACCGCGCCAT GGTCCGCGGC AAGGTCAACC TGTTCCGCCT GGCCCAGGAA GAGAGCGACG AGCGACGCGC CGCCATCCTC CGGCAGTACC GCAACTACGC CAGCCTGGCG GAACGCTACA GCGCCATTCC CTCGTGTTTT CTGGCCATCA CCCACGGAGT GTCCGCCGTC GGCAAGAGCT ATGTCGCCCA GCGCCTGACG GAAGAGTTCG GTGCCATCCG CCTGCGTTCG GACGTGGAAC GCAAGCGCCT GTTCGGCGAA CAGCCGGCGG CCGATCGGGA GCGGCTGACC AGCGGTATCT ACAGCGCACA GGCCAGCACG GCGACCTACG AACACCTGCA TCGGCTGGCT GCCGGCGCCC TGCAGGCAGG CTTTCCGGTG GCGATCGACG CCACCTACCT GAAGGAGGCA CAACGCGCGG CCGCCAGGCA GGTCGCCGAG AACAACGGTG CCCCCTTCCT GATCGTCGAC TGCCAGGCCC CCGAAGCCCT GATCGCCGAA TGGCTCACCG AGCGTCGGGC TGCCGGGAAG GATCCGTCCG ATGCCACGCT GGAGGTCATC CGGGCGCAAC AGGCCGGCCG CGAACCCCTG ACCGAGGCCG AGCAGCAACG CAGCCGACGG GTCGATACCC ACCTTACCGC CAGTCTCGAC GATCTGGTGG CGCGCATGCG CAATCACCTC CCCCACCTTT GA
|
Protein sequence | MRLSARQPRQ GLTVSQALIN ALQNPALYPH AVTEFKVIET HISWVLLTGP YAYKFKKAVD FGFLDFTGLA ARKHFCEEEV RLNQRLTRDL YLEVLPITGS ENAPQLAGDG PVIEYTVKMR QFPQSQLLGE MQTRGELAPE HIDALAERIA SFHLQAPKVP AEHPLGGAEA VMSPVRQNFE QVRPLLAEPA DLQQLDALEA WAETSFERLR PLLEQRKADG FIRECHGDLH LGNATLLDGQ VVLFDCIEFN EPFRLIDVVC DAAFLAMDLE DRGLKGFSRR FVSRWLELTG DYAALPLLNF YKAYRAMVRG KVNLFRLAQE ESDERRAAIL RQYRNYASLA ERYSAIPSCF LAITHGVSAV GKSYVAQRLT EEFGAIRLRS DVERKRLFGE QPAADRERLT SGIYSAQAST ATYEHLHRLA AGALQAGFPV AIDATYLKEA QRAAARQVAE NNGAPFLIVD CQAPEALIAE WLTERRAAGK DPSDATLEVI RAQQAGREPL TEAEQQRSRR VDTHLTASLD DLVARMRNHL PHL
|
| |