Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_15980 |
Symbol | |
ID | 7760533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1573417 |
End bp | 1576965 |
Gene Length | 3549 bp |
Protein Length | 1182 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643804498 |
Product | Glycosyl transferase, family 2 |
Protein accession | YP_002798788 |
Protein GI | 226943715 |
COG category | [S] Function unknown |
COG ID | [COG3551] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.967945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCAGA AAAATATAGT TGTCGTACTG GGCATGCATC GAAGCGGTAC TTCTGTAATC ACGCGAGGTC TGCAGGCGCT GGGTGTTCAG CTTGGTACCC GCTTGATGCC GGCAGCACCC GGAAACAATG AAAAGGGTTT CTGGGAGGAT ATGGATGTCA ATGCACTCAA CATCGAGTTG CTCGCTGCCC TGGGCCAGGA CTGGCATACC CTGACACCTT TGCTGCCGGA GCATTTGAAC CCTTCGGTTC TCGACTCCTT CAAACTGCAA GCCGTGCAAT TGTTGAGGGA AAAGTTATCC GGTGTCGAAA GTTTTGGTCT GAAGGACCCC CGGATCGCCA GGCTGCTGCC ATTCTGGACG AGCATATTTG CCCATCTGGA TGTTCAGGTT CGCTATGTGG TCGCCTGCCG CCATCCTATG AGTGTGGCTC GCTCGCTGGC CAAGCGTGAT GGTTTCGCCT TGGAAAAAGC CTATCAGTTA TGGCTGGAAC ACATGCTGGC AAGCTTGGCG GGAACGCAAA ATCGTCCTCG GGTCGTTGTC GACTATGACC TTCTCATGGA AGAGCCTGCC GTTCAGTTGC AGCGGATCGC GCAAGCACTG GATCTTGAGT TCGACGCGGA CAGTCAGGCT TTTGCCGAGT ACAAGGAAGA ATTCTTGGAG GAAGGACTCC GGCATAGCCG TTTCGCAGCG GCCGATCTGT GTCTGGACAA AGCAGCTTCT CCGCAGGTGG AGGAGTTGTA CGACGCTCTG CTGCAGATGG CCAGTGATCG GATAGCCCCC GATATACAAG AGGTAGAAGG GCTTATTGCG GCGTTGTCCG AGGAGTTACA GCGGAGTTAT TCGCTGCTGC ATTACACCGA TCTATGCGAG ATACGGATTG GCGAATTCAC CAGTCAAGTG GCTATTCAAG AAAGACAAAT AGCCGATGGT AACGCGCAGG TTGATACCTT GAATCAGACC ATTGCCGAGC GCGAAGGTCA GATTCACACG CTGAATCAGT TAGTGGCAGA AAGGGATGGG CAAATTGGCG GTCTGAACGG GATTGTCGCC GAGCGCGATG GCCAGATTCA TACATTGAAT CAGGTAATCT TGGAAAGGGA AGGGCAGATT GGTGACCTGA ACGAGGTTAT CGCCGAGGGC AACGGCCAGA TCCATGCGCT GAATCAGGTA ATCGCAGAAA AGGAAGAGCA GGTTGGTAGC CTGAACGGGG TTTTAGCCGA GCGCGATGGT CAAATTCGTA TATTGAATCA GGTAATCGCA GAAAGGGAAG AACAGATTGA CGGCTTGAAT AGGATTGTCG TCGAGCGTGA TGACCAGATT CATGCGTTGG ATCAGGTAAT CGCCGAAAGG GAAGGGCAGA TTGGCGGCTT GAATAGGATT GTCGCCGAGC ACAATGGCCA GATTCATGCG TTGAATCGGA TAGTCTCGGA AAGGGAAGGG CAGATTGGCG GCCTGAACGA GGTAGTTGCC GGGCGTGATA GCCATATTCA AGAGCTGAGT CGGATTGTGC ATGAGCGAGA TATCCAGATT GCCGCCATGC AGGAAATACA GTATGCCCTA CACAACTCCA GAAGTTGGCG TATCACTGCT CCTTTGCGTC GTGCCAACCA TCTGAGACGC AAGGTTGTGG ACGTTAAAGC TATCGCCTGT CGCCGGCTAC GAGATGAGTC TTTATCGACT CTTCTGAAAA AGACGTTGGG AATCTTGCGT CGGGAAGGAC TGAATGGGCT AAGAGCGAGA ATTCGCCATC AGCATTATCT GTCGACACTA GCCCCTGCGA CTCCTGCACC GCAGCTCTCA CCAACTCCTC CCTCCGCGGG AAATATGACC TCCATACCTA TGGCCATTGT GCGCCATTCT GGGGGACGCT ACGAATTGGC CGCTACCTCT AAAGGCTATA CCTATATCGA GCCACAGCGG CCTGCCGACT TGGGAGCCCG ACTCGCAATT CTCGACATTA CGCCTTTGTT CTCGATCGTG GTACCTGCCT ATAACACCAG CTCAGAACTA CTGGATGCCG TACTCTCTTC AGTACGGGCC CAGTGGTATC CTCACTGGGA GCTTATTTTG GTTGACGACG CCAGCCCTTC CGAAGAAACT CGGCGGGCAT TGGCCGAGAT AGACGATCCG AAAATCAGGG TTCTGCACCT GGAGAGCAAT AAAGGTATTT CCGGTGCCAC CAATGTAGGC TTGGCTGCCG CTCAAGGCGA GTTCATCGTG TTCATGGATC ACGATGACGA ACTAACCGTC GATTGCTTGT ATGAGTTGGC ATTATGCATC AATCGCGACC AGCCGGATTT TATCTATAGT GATGAGGATA AACTTACCGA GGAAGGGGAA TATACCCAGC CACATTTCAA GCCGGATTGG TCTCCCGATA CCATGATGAG CACCATGTTC ACCTGCCATG TATCTTGCGT GCGTCGCAGT CTGTTGAGCA AAGTGGGTGA ACTGCGTTCG GAGTTTGATG GCTGTCAGGA TTGGGATTTT ATCTTGCGTG TCGTCGAGCA TACGAATCGT ATAAGTCATA TTCCGAAGGT TCTTTATCAT TGGCGAATAA TTCCGGCTTC CGTCGCATCC GATATCTCCG CCAAGCCTTA TGTACTTGAA GCGTCCCGGC GTGTTCGACT GGATGCGTTG GAGCGTCGAG GCCTCAAAGG GAGTATAGAG CCGGTAGCTC AAGTTCCAGG ATATTTTCGT GTCAACTATC ACCTGCAAGG TTCGCCCCTG ATTTCGATTA TCATTCCTAG TCGGGATAAT GGTTCGGTTC TTCGCCGTTG TCTGGATTCC ATCCAGGAAA AAAGTAGCTA TCGGAATTTT GAAATCATCA TCCTTGATAA TGGTTCTGTT GAGGCTTCGA CTGTTGCTTA TCTGAAGGAG TTGCAAGAAA AAGGGGTAGC GCAGATTATT CGTCATGATG CTCCATTCAA TTTCTCCGAG CTTAATAATA TCGGTGCCAG GACTGCTGGT GGCGAATTGC TGTTGTTCCT CAACGATGAT ACCGAAGTGC TTTGCAATGA CTGGCTGGAG CGCATGGGAG GGTATGCTCA GTTAGTACAT ATTGGTGCTG TAGGGGCCAA ATTGCTTTAC CCGGATAGCT CTGAAATCCA ACATGCGGGT GTCCTCAATT TGGCGAATGG CCCTGTTCAT GCGTTCCTGC GTCATCATAG TGAGCGCCCA GGCTATTTTA TGCGTAATTT GTTGGAGTAC AACTGGCTGG CAGTCACTGG CGCCTGCTTG ATGATGGAGG CTTATAAGTT CAATGAGTTG GGGGGCTTCG ATGAAACCCT GCCGGTTGCT TATAACGATA TTGAGTTGTG TATAAGAGCT GTTGAGAAGG GCTATTATAA TGTGGTGTGT CAATCAGTGA CTCTGATCCA TCATGAGTCG GTCAGTCGAG GCCTTGATCA TGTCGATCCT GTAAAATTCG CACGTTTACA GAGAGAGCTT CGGCGTCTTT ATGATATGCA TCCCATGTTT TTTCAATATG ATCCTTTCTA TAATCCGAAC TTGCATCCAA ATGGGATTAA TTTTGAGGTG GCTTTATAA
|
Protein sequence | MEQKNIVVVL GMHRSGTSVI TRGLQALGVQ LGTRLMPAAP GNNEKGFWED MDVNALNIEL LAALGQDWHT LTPLLPEHLN PSVLDSFKLQ AVQLLREKLS GVESFGLKDP RIARLLPFWT SIFAHLDVQV RYVVACRHPM SVARSLAKRD GFALEKAYQL WLEHMLASLA GTQNRPRVVV DYDLLMEEPA VQLQRIAQAL DLEFDADSQA FAEYKEEFLE EGLRHSRFAA ADLCLDKAAS PQVEELYDAL LQMASDRIAP DIQEVEGLIA ALSEELQRSY SLLHYTDLCE IRIGEFTSQV AIQERQIADG NAQVDTLNQT IAEREGQIHT LNQLVAERDG QIGGLNGIVA ERDGQIHTLN QVILEREGQI GDLNEVIAEG NGQIHALNQV IAEKEEQVGS LNGVLAERDG QIRILNQVIA EREEQIDGLN RIVVERDDQI HALDQVIAER EGQIGGLNRI VAEHNGQIHA LNRIVSEREG QIGGLNEVVA GRDSHIQELS RIVHERDIQI AAMQEIQYAL HNSRSWRITA PLRRANHLRR KVVDVKAIAC RRLRDESLST LLKKTLGILR REGLNGLRAR IRHQHYLSTL APATPAPQLS PTPPSAGNMT SIPMAIVRHS GGRYELAATS KGYTYIEPQR PADLGARLAI LDITPLFSIV VPAYNTSSEL LDAVLSSVRA QWYPHWELIL VDDASPSEET RRALAEIDDP KIRVLHLESN KGISGATNVG LAAAQGEFIV FMDHDDELTV DCLYELALCI NRDQPDFIYS DEDKLTEEGE YTQPHFKPDW SPDTMMSTMF TCHVSCVRRS LLSKVGELRS EFDGCQDWDF ILRVVEHTNR ISHIPKVLYH WRIIPASVAS DISAKPYVLE ASRRVRLDAL ERRGLKGSIE PVAQVPGYFR VNYHLQGSPL ISIIIPSRDN GSVLRRCLDS IQEKSSYRNF EIIILDNGSV EASTVAYLKE LQEKGVAQII RHDAPFNFSE LNNIGARTAG GELLLFLNDD TEVLCNDWLE RMGGYAQLVH IGAVGAKLLY PDSSEIQHAG VLNLANGPVH AFLRHHSERP GYFMRNLLEY NWLAVTGACL MMEAYKFNEL GGFDETLPVA YNDIELCIRA VEKGYYNVVC QSVTLIHHES VSRGLDHVDP VKFARLQREL RRLYDMHPMF FQYDPFYNPN LHPNGINFEV AL
|
| |