Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_49670 |
Symbol | |
ID | 7763820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5030828 |
End bp | 5033953 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643807801 |
Product | hypothetical protein |
Protein accession | YP_002802035 |
Protein GI | 226946962 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGGTG GAGTGAACGT GCCCCGTTTG GACAATCAAC AGGTCGAGTA TCTATGGGCT GCTGGCGATC ATGAGCCGGA TTGGAGGCCA GAGCTGCGGG AGCACCTGAA AACGATAGCT GAGGCGTCTG GAGTGCCCCT GGGGCAGTGG TTATTGGGAA AGTGTTCCAT GCGCGAGGTG TCGGCCGAGC ACGTGCGTGA CCTGGAGGCG CAGCTTGGTA CGAATCACGC GCTACGGAAT ATGCTGCGCA CCATCACCCA ACGACTCAGG CGGCAGATTC CAGGTTTTCC ATTGGCCCGT ATCGCCATTG CCGTCGAACG CCCAGCGAAC CCGATCAACC CCGACATCTA CACGGCTGAA CGACTACGCA GAAGTCGCCA ACTGCTCAAT GTACTGCAGC AGGGATTGAG GTTGGATCTC GATTCGTTCC GCCCAGAGGA GCGCATCGGT CTGCTATTGA TGAGCGCGGC TTATGGCGGC GGCCTGATGG ACATTGCTCA ACTGAATGCG CTGGTCGAGG TGTCCCTGGA GCGAATCGAG TGGATCGCAG GCATTCCGGA GCTGAGATTG CCGCTCTCCA TCCGCGGCAA GGTGCAGGCG GAACACCGGC AATGGTTCCC GGACCCGGCC ACGCTGGCTC TGTTGACCCG CTGTTCGGAC GATATGCGAG CAATGGGCGC TCGATTGAGG CGCAGAGAAA CTGTTCTCAG GTGTATTCGA GCGTTTCTCG GAAGGAGCGG TGTACCGAGT CGGGATTTGC CGACCAGCCT GACGGAGCTG CTGGATTTGC TGCGGATGCA GATGCAATTG CGCCTGCCTC AGATTCTGGT GAATTTCGCA TGTCGACATG GGTTCGTATC GCAGTCGCTG AGACCATCGT CCTGGGGGGA AATGTTCGGC TATCCAGGCT TGGAAGATCC GGTTGGCACC TATGGAGATA GTGTCAGATC GGACGAGTAC GGAGAGGAAA AAACCGATAC TCCCGATTGG GTGCTGGATC TGTGTCGGCA GATCCGGGCA GGTGACCCTG TTGATCCATC CCCAGCGACT GAGGATTCTG AATCACTTGA GGTGCTTATC AGAGAGTGGG CTGCCTATCT GGTAGGTGGA TCGTCCGCAT ATGGTCACGA CATCGGGCGC AGTAGCATCA CTCGGTACGC CCGACTGCTT GGGGAGGCGT TGGCGTCGCA ATTGGATGGC CAGAGTGTTT TCCAGATGGA GCCCGATGCA CTGGAGATTG TCTATGAGAC GGTCCTCGAT GCCCAGATAA CCGATAGCAA GCGGCGCACC TTGGCCAAGG CGATTCATGA GTTCCACGCA TTTCTGGGGC GTCGCTATCA CTATCCACCG ATCAGCCCGT ACTCGGTATT GGGCATCGGC AGGGATGTGG CTAGCGTTGA TGCCCGGATC CTCTCCGAGG ACCAATATCA GGCCGTTTTG CGCGCCTTGG ACACCAGCGG GCTGGAATTA CGGACCCCGC GTCTGGTCAC GGCTGCCAAG CTGTTTCTGA TCCTGGGATT CAGGTTGGGC TTGCGGCGCA ATGAGGCCCT GAAGCTGCGG CTCAGTGATC TACATTTGCC CGAGTTATCG AGTGACGCCC GCGAACGTAT TCGCGGGCGT CATCCGGAGA TGCGGATCCT CTCCAACCAG GAGCTGGCAG GGTTGGAGCT GCCGGTCGAC CTGCTTGTAC GGCCACATGC ACAGCGCGGC CTGAAAACCC AGAACTCGGT TCGGCGGTTA CCGCTCCGCC TATTGTTGGA GCCCGAAGAA CTGGAACTGT TGATGGTCTG GTACCAGCAA CGGCAAGCAG AAGAGACACG GGCGCCTTCA TCGGAGTTTC TATTCTGCAT CCCGGAGCTG AGAACCCAGT GGGTCAGCGA AAGCACGCTG TTGCCGGCAT TGCATGCCTG CATGCGTGCA GTCACAGGTT CCGAGGTCAT TCACTATCAC CACCTGAGGC ATTCCTGCGC CACTTGGCAG ATGCTCAAAC TGATGGGAAC CATTACCGAC TCAGCGCCGG AGTTGATATT CCGTGATCTG CCCCTGACCA CTCGATGGCT CAGCGACAAT GCCAGGCAGC GTGAAGCGCT GATATCTGCC AACGGTGGAC CCACACGGCG TATCGTCCAT ATCGTCAGTG CCTTGCTCGG TCACGGTAGC CCCAAAACTT CGCTGCTGCA CTACATCCAC AGCCTACCTC TGGTAATGGC TCAGGCCTGG CAGTGGAACC CCAGAGTCTG GCTTTTCAGT GCCCATAACG TCGCCTCTAT CGCCAAGGTC AGCCTGCCCA CGACGGAGGC CAGTTCCGTT GGTGGTCCGG AGCATCTGTT GCGGGTCATC GGCCGGATCA GGTCGCTCAA GGCCAAAAGA CGGCCTCGAC GGACTGCGGT TTGTTTCGCC GTTCAGCAGG TCGAAAACAA CTGGGCGATC GAGCGGATTC GCCGGATCGA GTCGATGCTG GCATACGCAT CTTATGTCGA GAGTAGCGGT CGGCAGATCA ATCTGGAGTG GCTGGAGTTC GCGACAGAAG AGCGGAGCAT GATGCTGGAT CGAGCTCAGT ATATCCGCAG CTTGACGCAG AGATCCCAGC CGGAGGCTGG GGGCAAGCAC CGGCTGCGAG CCTCTATGCA GGCTGAGACG TCATCGCTCA TACCAATGCC TCCTCGACAT GGTGGAAGAG ATGCCGTAGC AGGGTATGCA GAACGTCTCT ACGAGTTACT GGATGGAGCG GAGAGCGAGC GCGCCAATCG AGCGATAGAC GATTTCGTCG AACGCTGTTG GGCAACGGAA ACGACGCTGC GCTTTTACCG TGATTGCGAC GAGGAGCATA CACGGGACTA CCTGTGGCTG CTCACAGCCA TCGGCGTTCC CGCCCAATCC ATCGAGTTGA TTATCTACGA CACACGAAAG CCCAGAGCCG CCAAGTCATA TTGGCGTCAG CAGCTCGGGA ATATTCGTCG GCCGATTAGC CAGCATGCGC CCGAAAATCC GGATGCCGAA AATACCCATC TTGGAATTCG GGCCACGCTG GCGCTAGAGG AGGGGCGGCA GCAGAACCGA CATTCGGGGG CAGCGCTGCG CTACCTATTC CTGATGGCCT CCATCGACTG GCATTTCCGG ACATGA
|
Protein sequence | MGGGVNVPRL DNQQVEYLWA AGDHEPDWRP ELREHLKTIA EASGVPLGQW LLGKCSMREV SAEHVRDLEA QLGTNHALRN MLRTITQRLR RQIPGFPLAR IAIAVERPAN PINPDIYTAE RLRRSRQLLN VLQQGLRLDL DSFRPEERIG LLLMSAAYGG GLMDIAQLNA LVEVSLERIE WIAGIPELRL PLSIRGKVQA EHRQWFPDPA TLALLTRCSD DMRAMGARLR RRETVLRCIR AFLGRSGVPS RDLPTSLTEL LDLLRMQMQL RLPQILVNFA CRHGFVSQSL RPSSWGEMFG YPGLEDPVGT YGDSVRSDEY GEEKTDTPDW VLDLCRQIRA GDPVDPSPAT EDSESLEVLI REWAAYLVGG SSAYGHDIGR SSITRYARLL GEALASQLDG QSVFQMEPDA LEIVYETVLD AQITDSKRRT LAKAIHEFHA FLGRRYHYPP ISPYSVLGIG RDVASVDARI LSEDQYQAVL RALDTSGLEL RTPRLVTAAK LFLILGFRLG LRRNEALKLR LSDLHLPELS SDARERIRGR HPEMRILSNQ ELAGLELPVD LLVRPHAQRG LKTQNSVRRL PLRLLLEPEE LELLMVWYQQ RQAEETRAPS SEFLFCIPEL RTQWVSESTL LPALHACMRA VTGSEVIHYH HLRHSCATWQ MLKLMGTITD SAPELIFRDL PLTTRWLSDN ARQREALISA NGGPTRRIVH IVSALLGHGS PKTSLLHYIH SLPLVMAQAW QWNPRVWLFS AHNVASIAKV SLPTTEASSV GGPEHLLRVI GRIRSLKAKR RPRRTAVCFA VQQVENNWAI ERIRRIESML AYASYVESSG RQINLEWLEF ATEERSMMLD RAQYIRSLTQ RSQPEAGGKH RLRASMQAET SSLIPMPPRH GGRDAVAGYA ERLYELLDGA ESERANRAID DFVERCWATE TTLRFYRDCD EEHTRDYLWL LTAIGVPAQS IELIIYDTRK PRAAKSYWRQ QLGNIRRPIS QHAPENPDAE NTHLGIRATL ALEEGRQQNR HSGAALRYLF LMASIDWHFR T
|
| |