Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_39050 |
Symbol | |
ID | 7762794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3953941 |
End bp | 3955161 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643806768 |
Product | hypothetical protein |
Protein accession | YP_002801020 |
Protein GI | 226945947 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.121024 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGCA GCCTGCATCA GGCCGGGGTG CGCTTTCTGC GCCTGCTGGG CCGGCATGCC GAGCCGATCA TGGACGCCTA CCTGGCCGGC TCGGTGACGG ATCAGGCGCT GGAGCCGGCG GTCGAGGAGC GGCTGGTCAG GGACGGTATC CTCTACCGTC CCGAGCCCGG CGCCGACCTG CACCTGCGCC GGGCGGTGCG CGCGCTGCTG GAGGAGGCCC TGCGCGACGA CCGCAACCGG CAGATCGACG CCAACGCCGG CGCCGCCCTG GCCACCTTCA AGACCCTCGC CGCCCACTAC AAGGAGGCGC GCCACCAGGG CGACTACGCG GCCGCCGACG CTTACCTGGG CGAACTGCGC GAGCACGTCT ACGCCTTCGG CGAGACCCTG GGCCACGGCA TTCGCGTGCT GTGGAGCCGG ATCAACAACG AGTTCGGCTA CGTCGGCACC CTCAACGCCA AGATCCGCGA GAACGAGTTG GCCCAGTCCC AGGTGAGCGA ACTGCTCGCC GGGCTGGAGC TGATCAGCTT CGAGGAACTG GCCGAGACCG CCGGCGACCT GCGCGAGCTG CGCCGCCTGC TGGTCACCAG CCTGCAGCGC ACGGTCAGCG CCTGCTCCCA GGAGCTGAGC GTGGTCCAGG GCCGCCTGCT CGAACTGCTC GGCCGCTTCC GCCAGATCCG CGGCCGCACC CGTCTGCTCA AGGGCTGGCT GCTGCACATG GAGCAGCAGC CGGACTACCG GGTGGGCAAC CACGCCGCCC AGCCGCAGGT CCCGCAACTG TTCAACCAGG CCCCGGCGAT CCTCGCTCCG GCCGCGGTGG ACGTCCACAA CCCGTCGCAG GAGGAGGTCC TGCTCGCCCT GGTGGCCCAG GCGCGCAGTC TGCAGCCGGC CGAGCGCCTG GGCCAGGCGC CGGGCGAGGC CGGCGAATTC GTGCTCGGCG CGCCCGAGGA CTTCGAGGTC GTCGCCAACC CGATCCGCGC GGCCGTCGAG GCCTACTTCT GCCGGATCAT CGACGGTGGC GAACGGCTTT CGGCCCTGGA GTACCGGGCG CAGCACGAAC TGCCCTGGGA TGCGGAAAGC TGGCTGTACC AGGTGATCGG CGGCTACGAG GGGCTGCCGG AGGAGCAGAA GCGCCACTTC GAGCTGGACC CCATCGGCGA GCCGCATCCG GTCTACTCGG GCAATTTCAT CGTGCGGGAC GTCAGGCTGT GGCTGGCCTG A
|
Protein sequence | MSGSLHQAGV RFLRLLGRHA EPIMDAYLAG SVTDQALEPA VEERLVRDGI LYRPEPGADL HLRRAVRALL EEALRDDRNR QIDANAGAAL ATFKTLAAHY KEARHQGDYA AADAYLGELR EHVYAFGETL GHGIRVLWSR INNEFGYVGT LNAKIRENEL AQSQVSELLA GLELISFEEL AETAGDLREL RRLLVTSLQR TVSACSQELS VVQGRLLELL GRFRQIRGRT RLLKGWLLHM EQQPDYRVGN HAAQPQVPQL FNQAPAILAP AAVDVHNPSQ EEVLLALVAQ ARSLQPAERL GQAPGEAGEF VLGAPEDFEV VANPIRAAVE AYFCRIIDGG ERLSALEYRA QHELPWDAES WLYQVIGGYE GLPEEQKRHF ELDPIGEPHP VYSGNFIVRD VRLWLA
|
| |