Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_43110 |
Symbol | |
ID | 7763184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4350465 |
End bp | 4353281 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643807166 |
Product | Fe-S/FAD domain protein |
Protein accession | YP_002801407 |
Protein GI | 226946334 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.326877 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCTGC CCGCTTCCTT TATTAGCGCC GTCGAGCGCC TGATTCCCCG CGAGCGGCGC TTCGACGATC CGCTGTCGAC CCTGGCCTTC GGCACCGACG CCAGTTTCTA CCGGCTGATT CCCAAGCTGG TCGTGCGCGT CGAGAGCGAA GCCGAAGTGG TCAAGCTGCT GCGCCTGGCC TCTGCCTCCC GCGTGCCGGT GACCTTCCGC GCCGCCGGCA CCAGCCTCTC CGGCCAGGCG GTGACCGACT CGGTGCTGAT CGTCCTCGGC GAGCACTGGA ACGGCCGCGA CATCCGCAAC GGCGGTGCGC AGATCCGCCT GCAGCCCGGC GTCATCGGCG CCCATGCCAA CGCCGTGCTG GCCCCGCTGG GACGCAAGAT CGGCCCGGAC CCGGCCTCGA TCAACGCGGC CAAGATCGGC GGCATCGTCG CCAACAACTC CAGCGGCATG TGCTGCGGCA CCGCGCAGAA CAGCTACCAC ACCCTCGCCG CTCTGCGCCT GGTGCTGGCC GACGGCAGCG TGCTGGACAC CGAGGACCCG CTCAGCGTGG CCCGCTTCCA GGCCAGCCAT GGCGAACTGC TCGAACGGCT CGCCGAACTG GGCCGGCAGA CCCGCGCCGA CGAAGCGCTG GCGGCGAAGA TCCGTCACAA GTACCGCCTG AAGAACACCA CCGGCCTGTC GCTCAACGCG CTGGTCGACT ACGACCAGCC GCTGGACATC CTCACCCACC TGATGGTCGG CTCCGAAGGC ACCCTGGGCT TCATCAGCGC GGTGACCTAC CACACCGTGC CCGAACACCC GCACAAGGCC AGCGCACTGA TCGTCTTCCC CGACGTGGAG ACCTGCTGCA ACGCCGTGCC GGTGCTCAAG CAGCAGCCGG TCTCGGCGGT GGAACTCCTG GACCGGCGCA GCCTGCGCTC GGTGGAGAAC ATGAAAGGCA TGCCGGACTG GGTGAAGAGC CTGTCCGCCA CCGCCTGCGC CCTGCTGATC GAGACCCGCG CCTCCGGCCC CTCGCTGCTG GCCGAGCAGA TCGAGCGGAT CATGGCCTCC ATCGCCGACT TCCCGGTGGA GAAACAGGTC GACTTCAGCA GCGACCCGGC CGTCTACAAC CAGTTGTGGA AGATCCGCAA GGACACTTTC CCGGCGGTCG GCGCGGTGCG CAGAACCGGC ACCACGGTGA TCATCGAGGA CGTCACCTTC CCCATCGAGC GCCTGGCCGA GGGCGTCAAC CGGCTGATCC AGCTCCTGGA GAAGCACCGC TACGACGAGG CGATCCTGTT CGGCCATGCC CTGGAAGGCA ACCTGCACTT CGTCTTCACC CAGGGCTTCG ACGAGCCGGC CCAGATCGCT CGCTACGAGG CCTTCATGCA GGAGGTGGCG CAACTGGTGG CGGTGGAGTT CGGCGGCTCG CTGAAGGCCG AGCACGGTAC CGGACGCAAC ATGGCGCCCT TCGTCGAGCT GGAATGGGGT CACGATGCCT ACCAGTTGAT GTGGAAGATC AAGCGCCTGC TCGACCCCAA GGGCATCCTC AACCCCGACG TGGTGCTCTC GGAAGATCCC GAGCTCCACC TGAAGAACCT CAAGCCATTG CCGGCCGCCG ACGAGATCGT CGACAAGTGC ATCGAGTGCG GCTTCTGCGA GCCGGTCTGC CCGTCCCGCG GCCTGACCCT GACGCCGCGC CAGCGCATCG TCATGTGGCG CGACATCCAG GCCAGGCGGC GCGACAACGC CGGCAGCACT GCTCTGGAAA AAGCCTACCG CTACCAGGGC ATCGACACCT GCGCGGCCAC CGGCCTGTGC GCCCAGCGCT GTCCGGTGGG GATCAACACC GGCGACCTGG TGCGCAAGCT GCGCGGCCTG GATGCCGGCC ATGCCGGAAC CGCCGACTGG CTGGCCGAGC ACTTCGCCGC CAGCGTGCGC GCCTCGCGCC TGGTGCTGCT CGCCGCCGAC AGCGCGCGGC GTCTGCTCGG TGCGCCGCGC CTGGAGAAGC TCAGCGGCGT TCTGTCGCGG ATCAGCGGCG GGCATCTGCC GCAGTGGACG CCGGCCATGC CACAGCCAGT GCGGCTGAAG CGGCCCGCCG CAGCGCAGCC CGACGAGCGA CCACGGGTGG TCTATCTGGC CGCCTGCGTA TCGCGCGCCA TGGGCCCGGC CGGCGACGAC CGTGAACAGG TACCGCTGCT CGACAAGACC CGCGCCCTGT TGGAAAAGGC CGGTTACCAG GTGGTCTTCC CGAAAGGCCT GGACAAGCTC TGCTGCGGCC AGCCGTTCGC TTCCAAGGGT TATCCCGAAC AGGCCGAGCG CAAGCGCCGG GAAACTCTCG ACGCCCTGCT CGAAGCCAGT CGCGGCGGTC TCGATCCGAT CTATTGCGAC ACCAGCCCCT GCACCCTGCG GCTGGTCAGG GAGCAGATCG ACCCGCGCCT GCAGGTCTTC GATCCGGTGA AGTTCATCCG CACCTTCCTG CTCGAACGGC TGGACTTCGA GCCGCAGGAA ACGCCGATCG CCGTGCATGT CACCTGCAGC ACCCAGCATC TGGGCGAGGC CGAGGCCTTG ATCGACATCG CCCGGCGCTG CGCCCGCGAA GTAGTGGTTC CGGAAGGCAT CCACTGCTGT GGCTTCGCCG GCGACAAGGG CTTCACCAGC CCCGAGCTGA ACGCCAACGC CCTGCGTACC CTCAAGGAGG CGGTGCAGTA TTGCGAGGAA GGCATTTCCA CCAGTCGCAC CTGCGAGATC GGTCTCAGCC GGCACAGTGG CCTTGACTAC CACGGGCTGG TCTACCTGGT CGATCGGGTC AGCCGCGCCA AGGCCCGTCC GGTCTGA
|
Protein sequence | MNLPASFISA VERLIPRERR FDDPLSTLAF GTDASFYRLI PKLVVRVESE AEVVKLLRLA SASRVPVTFR AAGTSLSGQA VTDSVLIVLG EHWNGRDIRN GGAQIRLQPG VIGAHANAVL APLGRKIGPD PASINAAKIG GIVANNSSGM CCGTAQNSYH TLAALRLVLA DGSVLDTEDP LSVARFQASH GELLERLAEL GRQTRADEAL AAKIRHKYRL KNTTGLSLNA LVDYDQPLDI LTHLMVGSEG TLGFISAVTY HTVPEHPHKA SALIVFPDVE TCCNAVPVLK QQPVSAVELL DRRSLRSVEN MKGMPDWVKS LSATACALLI ETRASGPSLL AEQIERIMAS IADFPVEKQV DFSSDPAVYN QLWKIRKDTF PAVGAVRRTG TTVIIEDVTF PIERLAEGVN RLIQLLEKHR YDEAILFGHA LEGNLHFVFT QGFDEPAQIA RYEAFMQEVA QLVAVEFGGS LKAEHGTGRN MAPFVELEWG HDAYQLMWKI KRLLDPKGIL NPDVVLSEDP ELHLKNLKPL PAADEIVDKC IECGFCEPVC PSRGLTLTPR QRIVMWRDIQ ARRRDNAGST ALEKAYRYQG IDTCAATGLC AQRCPVGINT GDLVRKLRGL DAGHAGTADW LAEHFAASVR ASRLVLLAAD SARRLLGAPR LEKLSGVLSR ISGGHLPQWT PAMPQPVRLK RPAAAQPDER PRVVYLAACV SRAMGPAGDD REQVPLLDKT RALLEKAGYQ VVFPKGLDKL CCGQPFASKG YPEQAERKRR ETLDALLEAS RGGLDPIYCD TSPCTLRLVR EQIDPRLQVF DPVKFIRTFL LERLDFEPQE TPIAVHVTCS TQHLGEAEAL IDIARRCARE VVVPEGIHCC GFAGDKGFTS PELNANALRT LKEAVQYCEE GISTSRTCEI GLSRHSGLDY HGLVYLVDRV SRAKARPV
|
| |