Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2082 |
Symbol | |
ID | 9339876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 2165324 |
End bp | 2166472 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | DNA polymerase IF1 subunit beta |
Protein accession | YP_003721251 |
Protein GI | 298491074 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0161872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTAG TTTGCTCTCA AAGCGACCTC AGTACCAATC TTTCACTCGT CAGTCGTGCA GTACCATCAA GACCTACTCA TCCCGTACTT GCTAACGTAC TTCTACAGGC GGATGCAGAA ACTAACCAGG TCAGCTTAAC AGCCTTTGAT CTCAGCTTGG GTATCCGTAG CACTTTTAGT GCTGAGGTAA TTGAAGGCGG TGCGATCGCT CTTCCTGCTA AGCTACTTGT AGATATTACC TCCCGTCTAC CAGAAGGCGA AATCACCCTA GATGACGAAT CAGGAGACAA CACCGGAGAA GGTATACTTG TCACCCTCAA ACCCAAAAGC GGCAAGTATC AAGTCCGGGC AATGGGAGCA GAAGAATTCC CCGAATTACC TGTAATTGAA AGCTCCGAAG CAATTCAACT CACTACGGCT GCAGTAATTG AAGGCTTGAA GGGTTCATTA TTTGCTACCA GTGCAGATGA AACCAAACAA GTCCTCACAG GCGTGCATTT AACGGTTAAA CAGGACACCT TAGAATTTGC CGCTACAGAT GGACACCGCC TTGCAGTCCT GGAAACTACT AACGAGCGTC CTATTGATAG TAATGAACAA CTAGAAGTGA CAGTACCAGC TAGAGCCTTG CGCGAATTAC AGCGGATGTT AGCCCATAGT TCATTAGAAG AAACAGTAGC CTTATATCTT GATCCTGGTC AAGTCGTATT CTCTTGGCAA AATCAACGCT TAACCAGTCG GACTTTAGAA GGACAATATC CCGCTTATCG GCAATTAATA CCGCGTCAAT TTGAGCGACA AGTCACACTA GAACGGCGGC AATTTATCAG CACCTTAGAG CGAATTGCAG TGTTAGCTGA TCAGAAAAAT AATATTGTCA AAGTCAGTAT TGATAATGCC AATCAAGAAA TTACTTTATC TTGTGAAGCG CAAGATGTGG GTAGTGGTAC AGAGTCAATG CCGGCAAAGA TTTCTGGGGA AGATATAGAC ATTGCTTTTA ACGTCAAATA TTTAATGGAA GGCGTGAAAG AGTTACCATC TTCCGAAATT CAAATGCATT TAAATCAAAG TTTAACTCCG GTAGTTTTTA CACCTTTAGG CGGTTTGAAA ATGACCTATT TAGCTATGCC TGTGCAACTT AGGAATTAG
|
Protein sequence | MKLVCSQSDL STNLSLVSRA VPSRPTHPVL ANVLLQADAE TNQVSLTAFD LSLGIRSTFS AEVIEGGAIA LPAKLLVDIT SRLPEGEITL DDESGDNTGE GILVTLKPKS GKYQVRAMGA EEFPELPVIE SSEAIQLTTA AVIEGLKGSL FATSADETKQ VLTGVHLTVK QDTLEFAATD GHRLAVLETT NERPIDSNEQ LEVTVPARAL RELQRMLAHS SLEETVALYL DPGQVVFSWQ NQRLTSRTLE GQYPAYRQLI PRQFERQVTL ERRQFISTLE RIAVLADQKN NIVKVSIDNA NQEITLSCEA QDVGSGTESM PAKISGEDID IAFNVKYLME GVKELPSSEI QMHLNQSLTP VVFTPLGGLK MTYLAMPVQL RN
|
| |