Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4175 |
Symbol | |
ID | 9341980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 4247150 |
End bp | 4248589 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003722715 |
Protein GI | 298492538 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.776953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCGCA ACCCCACCTT AACTTTTGAT CGTGGTACGT TAATTTTACA CCCACCACCA CACAGTAAAG CTTGGATAGA CTATGCAACA TGGGATGATA GAGTCGAAAA ATTCCGCATT CCGGCGATTA GATATCGTTG TTTAGTCGAA GCATTGCAAG CAGAAGATAC AAACTTCACA GATGAGGCCA AGGGATTTTA TCCTTTGGAT TTGGTTGCCA GTTTGGAAAT GACTCCATAC CCCCACCAAA ATGAGGCTTT AGCGGCTTGG AAATTAGCGG GAAGACAGGG AGTCGTAGTG TTGCCGACGG CTGCGGGTAA GACTTATCTG GCGCAAATGG CAATGCAAGC GACACCGCGC ACGACGTTGA TTGTTGTGCC AACTTTGGAT TTGATGCACC AGTGGTATGC ACACCTAACG GCGGCTTTTC CTGATGCGGA TTTGGGTTTG CTGGGGGGTG GTTCACGGGA TCAAACACCG ATTTTGGTGG CTACTTATGA CAGTGCAGCG ATTCATGCGG AAACTTTAGG TAATAAATAT GCTTTGATAA TTTTTGATGA ATGTCATCAT TTACCAACTG ATTTTAATCG AGTCATTGCT GAATATGCGA TCGCACCCTA TCGACTCGGA CTTTCTGCTA CACCAGAACG GACGGATGGT AAACACGCTG ATTTAAATAT TCTCATTGGT AGAGAAGTTT ATCGCCAAGG TGCTGAAGAT TTAGCTGGTA AGGCTTTAGC AGAACATCAA ATTGTGCAAA TTAAGGTCAA GTTATCCCAA TTGGAAAGGG AAAGATACAA TCAGCTAATT CAAACCCGCA ATGATTTTTT AAGGCAATCG CGGATTTCTT TGGGAAGTCT GCAAGGTTGG CAAACTTTTG TGCAAATGAG TGCGCGATCG CAAGTCGGAC GCAGAGCAAT GTTAGCACAC CGTCAAGCTA AAGAAATCGC CCTGGGAACT GATGGTAAAT TAAGAATTCT GATGGATTTA TTAGCTGAAC ATTATCCCGC TAGGGTGTTG ATTTTTACGG CAGATAATGC TACCGTTTAC CGTATTTCTC AAGATTTATT AATTCCGGCT ATTACTCATC AAACTCCGGT GAAGGAAAGG CATGAAATTT TAACTAAATT TAAGGAGGGT GAATATAATA CTTTGGTAGC TTCTCATGTC TTAAATGAGG GTGTTGATGT TCCCGCAGCT TCAATAGCAA TTATTCTTTC GGGGACTGGT TCGGCTAGGG AATATATTCA ACGTTTGGGG AGGGTTTTAC GCAAGGGTAA TATTGAAAAT AAACAGGCGA TTTTATATGA AGTCGTAGCA GAAGATACTA GTGAGGAGGG AACTTCGGCC AGGAGAAGGG GGGAAAGAAG TAACGAACCG CAAAGGTGCG AAGAGCGCGA AGAGAAGAAA GAGAAGAAGA AGGACAGGAA AGGGAATTGA
|
Protein sequence | MGRNPTLTFD RGTLILHPPP HSKAWIDYAT WDDRVEKFRI PAIRYRCLVE ALQAEDTNFT DEAKGFYPLD LVASLEMTPY PHQNEALAAW KLAGRQGVVV LPTAAGKTYL AQMAMQATPR TTLIVVPTLD LMHQWYAHLT AAFPDADLGL LGGGSRDQTP ILVATYDSAA IHAETLGNKY ALIIFDECHH LPTDFNRVIA EYAIAPYRLG LSATPERTDG KHADLNILIG REVYRQGAED LAGKALAEHQ IVQIKVKLSQ LERERYNQLI QTRNDFLRQS RISLGSLQGW QTFVQMSARS QVGRRAMLAH RQAKEIALGT DGKLRILMDL LAEHYPARVL IFTADNATVY RISQDLLIPA ITHQTPVKER HEILTKFKEG EYNTLVASHV LNEGVDVPAA SIAIILSGTG SAREYIQRLG RVLRKGNIEN KQAILYEVVA EDTSEEGTSA RRRGERSNEP QRCEEREEKK EKKKDRKGN
|
| |