Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_40880 |
Symbol | |
ID | 7762973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4123650 |
End bp | 4125512 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643806947 |
Product | hypothetical protein |
Protein accession | YP_002801198 |
Protein GI | 226946125 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.276542 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAAAC GCCTGCTTCC CCTTCTGCTG GGCGGCGCCT GCGCCACGCT CCAGGCCGAT CCGCAACCAT CGGCGGCGCG TCTGCACGAG TTGGCCGCGG AACCCTTCTG GCTCGCTCTC GGCCACTACC AGCGCGACCG ATTGGGCGGC TGGCGCAGCC ACGTCGACGA CGGCGGATTC TTCCTGGCCG AGCGGGGCGA ACGCGATCCG GCGGCGGAAC TCGCCGCCAC CCTGGTCGCC CTCTACCGGG ACCCGGCCCT CGCCGACCGC CATCCGCAGT GCCGCTTCCC GGCCCGCACC CGCTGGCTGC GCGAGCGCCT GGCGCTGGAC GACCTGCCGC AGCCGCAGTG CCGCGAATAC ACCGACTGGT ACGCCGACAT CGCCCCGCAC AGCACGGTAC TGGTGTTCCC CGCCGCCTAC CTGAACAGCC CCTCGTCGAT GTTCGGCCAC ACCCTGCTAC GCATCGATCC GCCCGGCATC GACAGCCAGG GCAGCCCGCT GCTCAGCTAT GCGCTGAACT TCGGCGCCTA CATCGAGGGC AGCGACAACA GCATCCTCTA TGCCTGGAAG GGCCTGATGG GCGGTTATCC CGGTCTCTTC GCCCTGCTGC CCTACCGGGA AAAGCTCGCC GAATACAGCC GCCTGGAAAA TCGCGATCTC TGGGAATACC GCCTGAACCT CAGCGCCGAG GAAACCGGGC GGATGGTCGA GCACGTCTGG GAACTGCGGC AGATCCAGTT CGACTACTAC TTCTTCGACG AAAACTGCTC CTACCGCCTG CTCGAGTTGC TGGAAATCGC CCGTCCGGGC CTGGAGCTGA CCGACCGTTT CCCGCTCACC GCCATCCCCG CCGACACCGT GCGCGCAGTG GAGCAGGCCG GGCTGATCGA GCGCAGCGCC TACCGTCCCT CGCGGGAGAA GGAACTGCTC GCGCGCGCCG ACCCGCTGAG CCGCCCCGAG CGCGACTGGA CGCGGCGCCT GGCCGGGGAC AGCACCGCGC TCGACGCTGC GGCCTTCCAG GCGTTGCCGG CCGAACGCCG GGCACTGGTA CAGGACGCCG CCTACCGGCT GATCCGCTAC CGCAAGAACG GCGAGGAACG CGACGACACC AGCGCCCGCG ACAGCTACCG CCTGCTGCAG GCGATCAACC GCGCGCCGCC GCCGCGCCTC GCCGTGGAGC GCCCGGTGCC GCCGGAAAAC GGCCACCAGT CGCGCACCTG GCAGCTCGCC CTCGGCAGCC GCGAAGACCG GAGCTTCGCC GAATACGGCC TGCGCATGGC CTACCATGAC CTGGCGGACA ACCTGGATGG CTTCCCGCTC GGCGCGCAGA TCGAGATCGC CCAGCTCAAG CTGCGCCAGT ACGAGGGCGA CCACTGGCAA CTGCAGCGGC TGGATGTGGC CAACATCCGC TCGCTGACCC CGCGCAGCAC TCTGCTCTCG CCCCTGTCCT GGCAGGTCGG CGGCGGTCTG GAGCGGGTCG TCGGCAAGGG ACGTCACGAC GACGAGCAAC TGGTCGGCCA TCTGAACGGC GGCGTCGGCG CCACCTGGCA CCTCGCCGAC GACCTGCTCG GCTTCGCCCT GGCCACCGCC CGGCTGGAGC GCAACGGGGA CTTCGCCGCC TTCCTCAGCC CGGCGGCCGG CTTCGATGCC GGCCTGCTGT GGCGCAACCG CCTGGGCAAC CTCGGCCTCG AGGCCCGCGC CGACTACTTC CACAACGGCG AGGTACGCCG CGAGCTGAGC CTGGTCCAGC AATGGGAAAT CGCCGGCAAC CTCGGCCTGC GCCTGGCCGC CCGGCGCGAA TTCAGCCAGC AGGGCGCGCC GGCCAGCGAA CTGAGCCTGC AGTTGCGCTG GTATCACTAT TGA
|
Protein sequence | MFKRLLPLLL GGACATLQAD PQPSAARLHE LAAEPFWLAL GHYQRDRLGG WRSHVDDGGF FLAERGERDP AAELAATLVA LYRDPALADR HPQCRFPART RWLRERLALD DLPQPQCREY TDWYADIAPH STVLVFPAAY LNSPSSMFGH TLLRIDPPGI DSQGSPLLSY ALNFGAYIEG SDNSILYAWK GLMGGYPGLF ALLPYREKLA EYSRLENRDL WEYRLNLSAE ETGRMVEHVW ELRQIQFDYY FFDENCSYRL LELLEIARPG LELTDRFPLT AIPADTVRAV EQAGLIERSA YRPSREKELL ARADPLSRPE RDWTRRLAGD STALDAAAFQ ALPAERRALV QDAAYRLIRY RKNGEERDDT SARDSYRLLQ AINRAPPPRL AVERPVPPEN GHQSRTWQLA LGSREDRSFA EYGLRMAYHD LADNLDGFPL GAQIEIAQLK LRQYEGDHWQ LQRLDVANIR SLTPRSTLLS PLSWQVGGGL ERVVGKGRHD DEQLVGHLNG GVGATWHLAD DLLGFALATA RLERNGDFAA FLSPAAGFDA GLLWRNRLGN LGLEARADYF HNGEVRRELS LVQQWEIAGN LGLRLAARRE FSQQGAPASE LSLQLRWYHY
|
| |