Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_51410 |
Symbol | |
ID | 7763981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5224421 |
End bp | 5226622 |
Gene Length | 2202 bp |
Protein Length | 733 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643807960 |
Product | Glycoside hydrolase, clan GH-D |
Protein accession | YP_002802194 |
Protein GI | 226947121 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAAG CACCTATCAC TTCTCCATGC ATTCCGGAAA TCGAAACGGC CGGCCAAGGG GGCGGCCATC CTGAAACATT GATCCGGCTG GATGGTACGA ACGCTACCTT ATTGCTGATC CGTCGCCGGG CGGGGTTGCC GGAAATCGCG TATTGGGGGC GGCGCCTGCC GCAGGAAATC TCGGAAGCCG AGATCTTCGC CGTGCGTGCC CCGGACAGCC CCAATAACGG GCCGGACCAG TGGCGCCCGC TGATCACGCT GCTCGATACG ATCGGCTGCT GGAATTTCGA TATGCCCGGC CTGATCGCGG CGCGTCCCGA TGGAAGCGGC TGGACTGCGA ATTTCGAGAC CGAGCAGGTC GCACAGGAGG GCCGGCGGCT CGTGGTACGG GGGAAGGACA CCGTATCCGG GCTTGCGATA GTGCTGAGTC TGGAACTCGG ACCCGATGAC GTGCTGACCA GCCAGGCGCG GCTGACCAAT ACCGGCACGC AGAGCCTGCG GGTCGAGCGG CTTGTCTCGG GGACCTATCT GTTGCCCGAG TCGGTCGATA CCGCGCATGT GCTGTCCGGC GAATGGGGGA ATGAATTCGG CATCGAGCCG ATGAACCTGA CCCGTGGCGG GATCGTGGTG GAGAGCCGCC GCAACCGTGC GCACGATCAC TTCCCCGGCA TGCTGCTCTC GCCGGCCAGC ACCTCGGAGA ACGAGGGCGA AGCCTGGGCC GCACAGCTTG GCTGGAGCGG CGGTCATCGT CTTTGCGTGG AGCGGATGGA GGACGGACGC ATCCGTTTGT CATGCGGCGA ATATCTCTAT TCGGGTGAGG GCGATCTCGC GCCTGGCGCC GAGCTGGTTA CACCGGTTGC CTATGCCACC TATTCGTCCG CCGGGTTTTC CGGGTGTGCC CGAGCCTTCC AGGCCCATGC GCGTCGCCAC CTTCTGCACT GGCCGGGCGG CAAGATGAAA CCGCGCCCGG TCCTTTTGAA CTCCTGGGAG GGGAGCGGGT TCGACCTGCA TGAGGATCAA CTGATGCGCC AGGTGGACGC GGCGGCCGCA CTCGGCATCG AACGGTTCGT ACTGGATGAC GGCTGGTTCG GGGCACGACG CGATTGCGAT GCCGGGCTTG GCGACTGGTT CTCCGCCGCT TCGGTTTTTC CGAATGGTCT GCGTCCCCTG TCCGACAGGA TTCACGGGCA CGGCATGGAG TTCGGGCTCT GGTACGAGCC GGAGATGGTC AATCCGGACA GCGACCTGTA TCGCGCGCAT CCGGACTGGG TGCTGCAGAC GCGCGGCTAC CCTTTGTGGA CGTCCCGCAA CCAGTTGGTG CTGGATATTT CGCGGCCCGA GGTCTCGAAT TACCTGTTCG AAGTGATCTC CGAGCAGGTC GCCAGCGTGC GGATCGATTA CATCAAGTGG GATTTCAATC GCGATCTGGT CGAGGCATCC GATGCGCAAG GTCGCGCGGC CTACCGCCGC CAGGTCCTGG CGCTCTATGC GCTGTGGGAG CGGCTGCACA AGGCGCATCC CGGCCTGGAG ATCGAAAGCT GCGCATCCGG CGGCGGCCGG GCTGACTGGG GCGCGCTGGC ACATACGCAA CGCGTCTGGA CTTCGGACGA CACCGATGCG CTGGAGCGTC TCGCGATCCA GGGCGCGGCA TGGCATTTCC TGCCGCCGGA GGTGACGGGC TGCCATATCT CGGAGGTTCC GAATGGGATA ACCGGTCGGA CGACGACGCT GGATTTCCGG GCGTGCGTGG CGATTTTCGG GCATCTGGGG GTGGAACTGG ATCCGACGCA CCTGTCCGCC GAGGAGTCCA CGCGGCTCAA GGCGTGGATC GCGCTCCACA AGCGGCTGCG CGGCCTGCTC CATCACGGGC TGGCACAGTT CTGCACGGCG GACCCGGCAC GGGTCGTGCG CGGGGTCGTC TCCGACGATG CACGGTCGGG GGTCTTCCTG GTAGCCCAGC GCGACTGGGT TTCCGCCCGC AGGCCTTCGC CCATCAGGCT GAGCGGTCTC GACCCCGCGA AAACCTATCG CATCACACTC CCTGAGCCGC AAAACCTGCC AGGGTACCGG CCGTCCGAGG CGCAAAAAGC GGTATTTTCC GGAAAAGTCC CGGTCAGCGG AGCGACATTG ATGGATGTCG GAGTTTTCCC GCCTTTCATG CCGCCATTGT CCGCCCTGGT CGTGGAGTTG CAGGCCGTAT AA
|
Protein sequence | MTQAPITSPC IPEIETAGQG GGHPETLIRL DGTNATLLLI RRRAGLPEIA YWGRRLPQEI SEAEIFAVRA PDSPNNGPDQ WRPLITLLDT IGCWNFDMPG LIAARPDGSG WTANFETEQV AQEGRRLVVR GKDTVSGLAI VLSLELGPDD VLTSQARLTN TGTQSLRVER LVSGTYLLPE SVDTAHVLSG EWGNEFGIEP MNLTRGGIVV ESRRNRAHDH FPGMLLSPAS TSENEGEAWA AQLGWSGGHR LCVERMEDGR IRLSCGEYLY SGEGDLAPGA ELVTPVAYAT YSSAGFSGCA RAFQAHARRH LLHWPGGKMK PRPVLLNSWE GSGFDLHEDQ LMRQVDAAAA LGIERFVLDD GWFGARRDCD AGLGDWFSAA SVFPNGLRPL SDRIHGHGME FGLWYEPEMV NPDSDLYRAH PDWVLQTRGY PLWTSRNQLV LDISRPEVSN YLFEVISEQV ASVRIDYIKW DFNRDLVEAS DAQGRAAYRR QVLALYALWE RLHKAHPGLE IESCASGGGR ADWGALAHTQ RVWTSDDTDA LERLAIQGAA WHFLPPEVTG CHISEVPNGI TGRTTTLDFR ACVAIFGHLG VELDPTHLSA EESTRLKAWI ALHKRLRGLL HHGLAQFCTA DPARVVRGVV SDDARSGVFL VAQRDWVSAR RPSPIRLSGL DPAKTYRITL PEPQNLPGYR PSEAQKAVFS GKVPVSGATL MDVGVFPPFM PPLSALVVEL QAV
|
| |