Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_42040 |
Symbol | |
ID | 7763082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4232559 |
End bp | 4233719 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643807057 |
Product | hypothetical protein |
Protein accession | YP_002801306 |
Protein GI | 226946233 |
COG category | [S] Function unknown |
COG ID | [COG3177] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGACT CACGCTGGAT CTGGCAGCAA CCGGATTGGC CGCATTTCCG CTGGCAGGCC GAGCGCCTGG CGCCGCTGCT GCGCGACTGC GCGCAGGCCC AGGGCCGACT GCTCGGTATG GCCGGCGCGG CAGGGGACGA ACTCGGCGCG CAGAGCGAGC TGGACGCCCT GCTGCAGAAC ATCGTCACCT CCTCGGCCAT CGAAGGCGAG CGGCTCGACG TCGGCTCGGT GCGCTCCTCG CTGGCCCGCC GCCTGGGCCT CGAAACGGAC GGCGCCAGCC GCGTCAGCCC GCGCAGCGAA GGTCTCGCCG AGCTGATGCT GGACGCCACC CGGCACCTGG AGCAACCGCT GAGCCTCGAA CACCTGCTGC ACTGGCACCG CCTGCTGTTT CCCGCGCAGG ACGACGACCT GCTGCCACGG CGCATCCGCG TCGGCGCCCT GCGCGGCGAA GAGCCTATGC AGGTGGTCTC CGGCCGGCTC GACCGGCTCA CCGTGCATTT CGAGGCGCCG CCGCGCGACG GGCTGGAACG GCAACTGGCC GCCTTCCTCG ACTGGTTCGA GGCGAGCCGC CGCGACGGCG GCCTCGACCC GCTGCTGCGC GCCGGCATCG CCCATTTCTG GTTCGTCACC CTGCATCCCT TCGACGACGG CAACGGCCGG CTGACCCGCG CCATCACCGA CCTGGCGCTG GCCCAGGGCG AGCGCCAGGC GATCCGCCTG CATGCCATGT CGGCCAGTAT CCTCGACGAC CGCCAGGGCT ACTACCGCAT CCTCGAAGCC AACCAGAAGG GTGGCCCGGA GATCACCCCC TGGCTGGAAT GGTTTCTCCA TACCCTGCTG CGCAGCCTGC AACAGGCGCT GGCGCGCATC GACCGGGTGC TGGCCAAGTC GCGCTTCTGG CAGCGGCACC GCCATCAGGC GCTGTCCGCC GAACAGATCA AGGTGCTCGA TCGCCTGCTC GACGGCGGCG AACGCGGCTT CGAAGGCGGC ATCAGCGCCG CCCAGTACCA GGCCGTCGCC AGGGTTTCCA AGGCCACCGC CACCCGCCAC CTCGGTGACC TGCTCGACAA GGGCTGGCTC GTCCGCCTGC CCGGCGGCGG GCGCAGCACC CGTTACCGGA TCGACTGGCC CACCACCGGC GGGCCCGATA CATCCCTGTA G
|
Protein sequence | MEDSRWIWQQ PDWPHFRWQA ERLAPLLRDC AQAQGRLLGM AGAAGDELGA QSELDALLQN IVTSSAIEGE RLDVGSVRSS LARRLGLETD GASRVSPRSE GLAELMLDAT RHLEQPLSLE HLLHWHRLLF PAQDDDLLPR RIRVGALRGE EPMQVVSGRL DRLTVHFEAP PRDGLERQLA AFLDWFEASR RDGGLDPLLR AGIAHFWFVT LHPFDDGNGR LTRAITDLAL AQGERQAIRL HAMSASILDD RQGYYRILEA NQKGGPEITP WLEWFLHTLL RSLQQALARI DRVLAKSRFW QRHRHQALSA EQIKVLDRLL DGGERGFEGG ISAAQYQAVA RVSKATATRH LGDLLDKGWL VRLPGGGRST RYRIDWPTTG GPDTSL
|
| |