Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_35020 |
Symbol | |
ID | 7762397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3575956 |
End bp | 3577254 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643806368 |
Product | hemolysin-like protein |
Protein accession | YP_002800626 |
Protein GI | 226945553 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.518958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGCGG GCCTGCTCCT GCTGCTGATC GCGCTCAACG GCGTGTTCGC GATGGCGGAG ATCGCCGTGG TTTCCTCGCG CAAGGCCCGC CTGCAGAGGC TGGCCGACGA GCGGCGGCAC GGCGCGCTGG TCGCCCTGGC GCTACAAGAT GATCCTTCGC GCTTTCTATC CACCATCCAG GTCGGCATCA CTACCGTGGG CGTTCTCAGC GGCGCACTCG GCGAAACCCT GCTGGCCCGG CCGCTGACGC AGCGGATCGC CGAACTGCCG GCGCTAGCGC CCTATGCCGA GGCCATCGCC CTGATCCTCA CCGTGGCGGC GATCACCTAT CTGTCCGTGG TGGTCGGCGA ACTGCTGCCC AAACGCCTGG CCCTGCTCGC GCCGGAAACC ATCGCCAGCG CGCTCGCACC GGCGATGCGA CGCCTGGCGC AGATTGCCGC GCCGCTGGTA TGGCTGCTGT CGTATTCCTG CAATCTGTTG CTGCGCCTGA TCGGCATCGG ACGTCGCGAC GAACCGCCGA TCACCGACGA GGAGATCCAG GTGCTGATGG AGCAGGGGGC CGAGGCCGGG GTTTTCCACG AGAGCGAGCA AGCGTTCGTG GCCAATGTCC TGCACCTGGA CGAGCAGCCG GTCGGCGCGA TCATGTCGCC CCGCCAGCAG GTCTACGCCA TCGATCTGGA CGATCCGCGG GAGGAGCAGC TCAGGCGGCT CGCCGAGAGC CCCTATACCC GTGTCCTGGT ATGCCGGGGC GGCCTGCATC GGCTGGTCGG TGTGCTGCAT CGCGGCGATC TGCTCAAGCC GGCCCTGCAA GGGCAGCCGA TCGATCTGCA GCGGAGCGCC CGGCCGCCGT TCTATGTGGA GGAAAGCGCG AGCAGTACCG GCCTGCTGGA GGATTTCCGC AGGACGCGCA ACGAGTTCGC GGTGGTCGTC GACGAATTCG ACGACCTGCA AGGCATCGTC ACCCTCAGGG ACGTGCTCAC CGCGATCGTC GGCGAGATTC CCGATGCACT GCACGACGGC GAGCCGGCGA TCGTGCGCCG CGAGGACGGT TCCTGGCTGG TCGACGGTGG CATGGGTATC GAGCAGTTGA AAACCGCGCT GGACATCGAC GCGGCGTTTC CGGGCGAGGC GGACAACGCC TATCGCACGC TCGCCGGTCT GGTCATGCAT TGCCTGCAGC GCGTGCCGAG CGTTTCCGAT CACTTCGAAC TCGACGGCTG GCGCTTCGAG GTGGTCGACA TGGACCGGAC GCGCATCGAC AAGGTGCTGG TGATGCGCCC GAGCGCGCCG CCGGCCTGA
|
Protein sequence | MEAGLLLLLI ALNGVFAMAE IAVVSSRKAR LQRLADERRH GALVALALQD DPSRFLSTIQ VGITTVGVLS GALGETLLAR PLTQRIAELP ALAPYAEAIA LILTVAAITY LSVVVGELLP KRLALLAPET IASALAPAMR RLAQIAAPLV WLLSYSCNLL LRLIGIGRRD EPPITDEEIQ VLMEQGAEAG VFHESEQAFV ANVLHLDEQP VGAIMSPRQQ VYAIDLDDPR EEQLRRLAES PYTRVLVCRG GLHRLVGVLH RGDLLKPALQ GQPIDLQRSA RPPFYVEESA SSTGLLEDFR RTRNEFAVVV DEFDDLQGIV TLRDVLTAIV GEIPDALHDG EPAIVRREDG SWLVDGGMGI EQLKTALDID AAFPGEADNA YRTLAGLVMH CLQRVPSVSD HFELDGWRFE VVDMDRTRID KVLVMRPSAP PA
|
| |