Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_37850 |
Symbol | |
ID | 7762677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3828702 |
End bp | 3830117 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643806651 |
Product | hypothetical protein |
Protein accession | YP_002800904 |
Protein GI | 226945831 |
COG category | [C] Energy production and conversion |
COG ID | [COG3488] Predicted thiol oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0692567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTCAC CGTTTCACCG CCTCACCCTG CTGCTGGCAC TCGGCCTCGG CGCCTGCGAC GGCCAGGCAC CGGACTTCGG CCGGGCCGAA CCCGACGAAG CCCTGTCGGC CGGCAGCGCC ACCGTCCGAC GCAGCGACCG GGACGCCTTC GCCCAGCCCG CGGCCAACCT GACGGCCTCG CAACGCATGG ACTTCGCCGT CGGCAACAGC TTCTTCCGCA AGCCCTGGGT GATCGCCCCG TCCAGCACCA CCGCCCGCGA CGGCCTCGGC CCGCTGTTCA ACACCAATGC CTGCCTGAAC TGCCACATCC GCGACGGCCG CGGCCACCCA CCGGAGGCGC GGGCGGACAA CGCCGTGTCC CTGCTGCTGC GCCTGTCGAT CCCGGAAACG CCGGAAACCG CCTCCATCGC CGCACGCCTC GGCGTGGTTC CCGAACCGCT CTACGGCACC CAACTGCAGG ACATGGCGGT GCCCGGCGTG ATCCCGGAGG CCCGGGTGCG CCTGAGCTAC GACACCCACA GCCTGCGTTT CGCCGACGGC TTCGCAGTGG AGCTGCGCCG GCCACGACTG CAGCTAGACC GGTTGGGCTA CGGTCCGCTG CATCCCGACA CACGCTTTTC CCTGCGCGTC GCCCCACCGA TGATCGGCCT CGGCCTGCTG GAGGCGATTC CCGACACAGC CCTTCTCGCC AATGCCGACC CCGACGACGC CGATGGCGAC GGCATCTCCG GCCGGCCCAA CCGGGTTCGC GACCACGCCA CCCTGAACAC CGCCCTCGGC CGTTTCGGCT GGAAGGCCGG ACAGCCGAAT CTCGGCCAGC AGAACGCCGA GGCCTTCCTC AACGATCTGG GACTTTCCAG CCGCCTGCGT CCCGGCAACA ACTGCACCCC GGCGCAGAGC GCCTGCCTGG CCGCTGCCGA TGGCGGGACG CCGGAGGTCG ACGACCACCT GCTCGCCCGC GTGCTGTTCT ACACCCGCCA CCTCGGCGTA CCGGCCCGGC GCGCCGTGGA CGACCCGCAG GTGCTGGCCG GCAAGGCGCT GTTCCATGGC GCCGGCTGCG CGCGGTGCCA CACCCCGACC TTCGTCACCG CCGCGGACGC CGCCGAAGCG ACGCTGGCCA GCCAGAAGAT CCACCCCTAC AGCGACCTTC TGCTGCATGA CATGGGCGAC GGTCTGGCCG ACAATCGTGC GGAATTCCAG GCCAGCGGTC GCGAATGGCG GACTCCGCCG CTATGGGGGC TCGGTTTGAC GCGACGGGTG AGCGGCCACA CCCAGCTCCT CCACGACGGC CGGGCACGCA ATCCGCTGGA AGCGATTCTC TGGCACGGCG GCGAGGCGCA AGCGGCGCGG GATCGGGTCC TGGCCTTCGA CGCAGGCCAG CGTGCGGCAC TCCTGGCCTT TCTGAACTCC CTCTGA
|
Protein sequence | MPSPFHRLTL LLALGLGACD GQAPDFGRAE PDEALSAGSA TVRRSDRDAF AQPAANLTAS QRMDFAVGNS FFRKPWVIAP SSTTARDGLG PLFNTNACLN CHIRDGRGHP PEARADNAVS LLLRLSIPET PETASIAARL GVVPEPLYGT QLQDMAVPGV IPEARVRLSY DTHSLRFADG FAVELRRPRL QLDRLGYGPL HPDTRFSLRV APPMIGLGLL EAIPDTALLA NADPDDADGD GISGRPNRVR DHATLNTALG RFGWKAGQPN LGQQNAEAFL NDLGLSSRLR PGNNCTPAQS ACLAAADGGT PEVDDHLLAR VLFYTRHLGV PARRAVDDPQ VLAGKALFHG AGCARCHTPT FVTAADAAEA TLASQKIHPY SDLLLHDMGD GLADNRAEFQ ASGREWRTPP LWGLGLTRRV SGHTQLLHDG RARNPLEAIL WHGGEAQAAR DRVLAFDAGQ RAALLAFLNS L
|
| |