Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_42450 |
Symbol | |
ID | 7763121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4272034 |
End bp | 4274349 |
Gene Length | 2316 bp |
Protein Length | 771 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643807096 |
Product | hypothetical protein |
Protein accession | YP_002801344 |
Protein GI | 226946271 |
COG category | [R] General function prediction only |
COG ID | [COG3008] Paraquat-inducible protein B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGACT TGCCCGAGGC GAGGACCCGT CCGGCCTCTA CCTGGTCGGC GATCTGGGTG TTGCCGTTGA TCGCCTTGCT GATCGGCTCC TGGCTTGCCT GGCGCGCCTA CGACCAGGCC GGCATCGAAG TTCAGGTGCG CTTCACCAGC GGCGAAGGTA TCCAGATCAA CAAGACCGAG GTGATCTACA AGGGTATGCC GGTCGGCAAG GTCGTAGGTC TTACCCTGGA CGACGAAGGC TCGAACACTG GGGTGATCGC CACGCTGGAG ATGAACAAGG ACGTCGAGTC CTACCTGCGC AGCAATACAC GCTTCTGGCT GGTCAAGCCA CGGGTTTCCC TGGCTGGCAT CACCGGTCTG GAAACCTTGG TTTCCGGGAA CTACATAGCG GTCAGCCCTG GAGATGGCGA GCCGAACAAG AAGTTCACCG CTCTGTCCGA AGAACCACCC CTGCCGGACA GCACGCCGGG ATTGCATATC ACCCTGAAAG CCGAGCGGCT GGGCTCGTTG AATCGAGACA GTCCCGTGTT TTACAAGCAG ATCCAGGTCG GCCGGGTAAA GAGCTACCAA CTGGCCGAGG ATCTGAGTAC CGTGGAGATC AGGATCTTCG TCGAGCCGGC CTATGCCCAT CTGGTACGCA AGCACACGCG CTTCTGGAAT GCCAGTGGAA TCACGGTCGA TGCCGGACTC GGCGGGGTCA AATTCCATAC CGAGTCCCTG GCCAGCATCG TGGCGGGCGG CATTGCCTTC GCCACGCCGG AGAACCGCAA GGACAGCCCG CCCACCGATC CGCGCCTGCC GTTTCGTCTC TATGACAATT TCGACGCGGC CCAGACCGGT ATCAAGGTCA TGCTCGAGCT GAGCGACTTC GAGGGACTCC AGGCCGGGCG CACCCCGGTG GTGTACAAAG GCATCCAGGT CGGCATCCTG AAAACCTTGC AGATCGAGCC CGGTCTTTCC CGCGCCATGG CCGAGCTGAC TCTGGATCCG CTGGCCGAAG ATTTCCTGGT CGAGGGTGCC GATTTCTGGG TGGTCAAGCC CTCCATCTCC CTGGGCGGGG TCACCGGCCT CGAAGCGCTG GTGAAGGGCA ATTACATCGG CATGCGTCCG GGCGAGAAGG GGGCGTCGGT CAGGCGTTCC TTCGTTGCCC GCAGCAAGGC GCCGCCCATG GATCTCGGCG CTCCCGGTCT GCACTTGGTG CTGACCAGCG ATACCCTCGG TTCGCTGGAT ATCGGTAGTC CGGTGCTCTA CCGGCAGATC AAGGTCGGTT CGGTGCAGAG TTTCCAGTTG TCCCGCAATC GCCGCCGTGT GCTGCTGGGC GTGCACATCG AGCCGGAATA CGCCAGTCTG GTGAACAGCT CGACGCGCTT CTGGAATGCC AGTGGCGTCA CCTTGAGCGG TGGTCTGTCC GGAATCGAGG TGAAGAGCGA GTCCCTGCAG ACCTTGCTGG CCGGCGGCAT CGCTTTCGAA ACTCCCGATC CCGAGGCGTC CGCCAACACG CGGCGGATTC CGCGCTACGC CCTGCACGCC GATCGCGAGA CCGCCCTGCA GGCCGGGCTG GAGCTGCAGA TCCGCGTCGA CAGCGGTGAC GGGCTGAAAG CCGGCACGCC GATCCGCTAC AAGGGGCTGG ATGTCGGCAA GGTGGAAGGG GTCGAACTGA GCGACGACCT GCAATCGGTC CTGCTCCGCG CCCGGATCAC CCAGGCAGCC GAGCGCATCG CCCGTGTCGG CAGCCAGTTC TGGGTGGTCA GGCCCGAACT CGGCCTGATG CGTACCGCCA ACCTGGATAC CCTGATCAGC GGCCCCTACA TCGAGGTGCG GCCGGATGCC GGCAAGTCCT CCCGGCAGAC GAGTTTCGTC GCCCTGTCGC GCGCGCCGGA AAGCGCCGCC AAGCCGGAGC CCGGCCTTCG GCTGGTGCTC AGTTCGCCGC GGCGCGGTTC GCTCAAGGCG GGCGTACCGG TCACCTACCG CGAGGTGACG GTCGGCAAGG TCACCGGATT CGAACTGGGG CCGAACGCCG ACCGGGTGCT GATCGGCATT CTCATCGAAC CGCGCTATGC GCCTTTGGTG CGCAGTGGCA GCCGCTTCTG GAACGCCAGT GGCTTCGGTT TCGATTTCAG CCTGCTCAAG GGGGCGCAAC TGCGTACCGA GTCGCTGGAA ACCCTGCTCG AAGGCGGCAT CGCCTTCGCC ACGCCCGACG GCGAACGGAT GGGCAAGCCG GCGCTTCCGG GGCAGACCTT CCCGCTATTC TCCGAAGCGG ACGGCGAATG GCTGCAATGG GCGCCGAAGA TCGCTCTGGA AAAGGAGAGG AAGTGA
|
Protein sequence | MNDLPEARTR PASTWSAIWV LPLIALLIGS WLAWRAYDQA GIEVQVRFTS GEGIQINKTE VIYKGMPVGK VVGLTLDDEG SNTGVIATLE MNKDVESYLR SNTRFWLVKP RVSLAGITGL ETLVSGNYIA VSPGDGEPNK KFTALSEEPP LPDSTPGLHI TLKAERLGSL NRDSPVFYKQ IQVGRVKSYQ LAEDLSTVEI RIFVEPAYAH LVRKHTRFWN ASGITVDAGL GGVKFHTESL ASIVAGGIAF ATPENRKDSP PTDPRLPFRL YDNFDAAQTG IKVMLELSDF EGLQAGRTPV VYKGIQVGIL KTLQIEPGLS RAMAELTLDP LAEDFLVEGA DFWVVKPSIS LGGVTGLEAL VKGNYIGMRP GEKGASVRRS FVARSKAPPM DLGAPGLHLV LTSDTLGSLD IGSPVLYRQI KVGSVQSFQL SRNRRRVLLG VHIEPEYASL VNSSTRFWNA SGVTLSGGLS GIEVKSESLQ TLLAGGIAFE TPDPEASANT RRIPRYALHA DRETALQAGL ELQIRVDSGD GLKAGTPIRY KGLDVGKVEG VELSDDLQSV LLRARITQAA ERIARVGSQF WVVRPELGLM RTANLDTLIS GPYIEVRPDA GKSSRQTSFV ALSRAPESAA KPEPGLRLVL SSPRRGSLKA GVPVTYREVT VGKVTGFELG PNADRVLIGI LIEPRYAPLV RSGSRFWNAS GFGFDFSLLK GAQLRTESLE TLLEGGIAFA TPDGERMGKP ALPGQTFPLF SEADGEWLQW APKIALEKER K
|
| |