Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0696 |
Symbol | |
ID | 9338482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 733119 |
End bp | 736142 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | |
Product | exonuclease SbcC |
Protein accession | YP_003720284 |
Protein GI | 298490107 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCCAG TTCAACTTAT CCTTAAAAAC TTCCTTAGTT ACCGTGATGC AACTTTAGAT TTTGGCGGTT TGCATACGGC TTGTATTTGT GGTTCTAATG GTGCAGGTAA ATCTTCCCTT CTGGAAGCTA TCACTTGGTC TATTTGGGGT CAAAGCCGTG CCACTGTTGA AGATGATGTT ATCTATTCTG GCGCAAAAGA AGTCAGAGTT GATTTTACTT TCTACAATAA CCAACAAACT TATCGCGTAA TTCGTACTCG CGCACGGGGT GCCACTAGCA TCCTGGAATT TCAAATTGAA ACTCCTGCCG GTTTTCGTCC CCTAACTCGC AAAGGGATGC GAGCAACGCA GGATGTGATT ATACAACATA TCAAGCTCGA TTACGAAACC TTTATTAATT CTGCTTACTT ACGTCAAGGA CGAGCAGATG AATTCATGCT CAAACGTCCT ACTGAACGGA AGGAAATTTT AGCGGAGTTG TTAAAACTCG ATCAATATGA TGTATTGGAA GAAAGAGCTA AGGACAGTTC TAAACTTTAT AAAGGAAGGG CGGAAGAGTT AGAGCGTTCT TTGGATAATA TCAAAGTTCA ACTCCAACAA CAGGAAACAA CAAAAGCGCA AAGAGTGGAG TTAGAATCTC AACTTAATAG TCTTCAACAG CAGCAAGCTC TTGATAATAT TCAATTGCAA AGTTTGCAAG TTGTCGAACA TAAACGCCAA AACTGGGAAC AACAACTGAA TTTTGTCTGG CAACAATATC AAAATCTTAG CCAAGATTGT GATCGCTTGC ATGAAGAACA ATTAGCTGTT AAATCCCAAT TAGCAGATTT AAAAGTCATT TTAAATCAAG CTGCCGAAAT TATCGCCGGA TACGCTCAAT ATCAGAGTCT ACAATCCCAA GAAGAGGCTT TTGCTGTTAA ATCTGAACAA CATACCCGCG CTACCAGCTT CCGACAACAA CAACAACAAG AGCTTACTAA ACAAGTCCAA ACAATTGAAT ACCGATTTCA ACAAGCTCAA GCTCAATTAG AAGGTTTAGA ACAACAAGAG CAAGAAATTC AACAAACTCT CACTAAATCT TCGGAAGTAG AAACTGCTTT AGCTCAATTA GCTTCGGCTC GTAAGCATCT TAATTATTTC GATCAGTTGC AAATGCAAGT GAATCCTTTA TTACAACAAC GGTTAAGTTT ACAGAATCAA TTAGATCGCA CTCGTGCTAG TTTAGTAGCG CGGCTGGAAC AACTGCAAGC TACAGAAACC CAACTCCAAA GTCAATATCG TCGTCAACCA CAACTACAAC AAGCGGCGCT AGATGTGGGT ATACAAATTG AAGAACTGGA GAAAAAACGG GTGTATTTAC AGCGGGTGCA GGAAAAAGGA CAGGAACGCA GGCACTTTAT CGAACGTTTA CAAGTACACC AACGAGATTA CGAGAAATTA CTGGGAGAAC TAGAGCAGAA ATTACAAGTA CTCCAAAGTC CTAATGCTTT GTGTCCTTTG TGTGAACGTC ATCTAGATGA GCATCACTGG AGTCGGGTTA TACAAAAAAC CCAACTAGAG TATGAAGATA CCCAAGGACA ATTTTGGGTA GTGCGGGAAC AAATGGCGGT TTCTGACAGA GAAATTCAGG TACTTAGACA AGAATATCGA GAAATTTCTC AGCAATTAGC TGGTTATGAT GCTTTGCGTG AACAAAGGGG ACAATTAGCT GCAAAATTAC AAGCAACTAC AGATGTTCAA GAGCAGTTAC AACAAATTGC TCTGGAAAGA GAACATTTAG AAAGTTCTTT GCAAGGAGAT TATGCTCCTG ATAAACAAGT AGAACTCCAG CAATTAGAGC AATATCTGCA ACAGTTGAAT TATAATGAAC AAGACCATAC TTTAGCTAGA AGTGAAGTAG AGCGTTGGCG ATGGGCAGAA ATTAAACAAG CACAAATTAA AGATGCTACT AAAAAACAGG CTCAATTAGC AGCCAGAAAA CCAAAATTAC AAGCTACTAT TGACGAATTT AAGCTCAAAA TTCAGTTAGA ACAAACAGAT TCTGATACAG CTAAACAAAT AGAAGCTTTA ACTCAGGAAA TTAAAGAGCT TAACTACAGT TCTGAACAAC ACAATAAGTT GCGTCAAGCT GTACGTGAGT CACAATCTTG GCAGTTGCGT TATCAACAGT TTTTGTCGGC TCAACAAAAG TATCCTCAAC TTGAGACAAG ATTAGAAGAT TTGGCAAGTT CTTACAAGAG TAGATTAGCA GATCAGCAAA GATTTGCTAC TCAAATTGAC AGCATTGTAG AGCAATTAAA AGCTACAGCG AACCCGACGG AGCAAATTAA TGCTTTAGAA CAGCAAATAG CGATTCGCAG AAGGGAACTT GACGAGAAAA TAGCTAATTT GGGGCGTGTA GAACAACTAT TACATCAATT ACAAACGTTG CAGACTCAGT ATGTGCAAGA ACAGGAACAA TTAAAATATT GTCAGCAGCA ACATCGTGTT TATCACGAAT TAACGCAAGC TTTTGGTAAA AATGGTATCC AAGCGTTGAT GATTGAAAAT GTGTTACCAC AACTAGAAGC TGAGACAAAT CAACTACTTT CACGGTTGAG TGCTAATCAA CTACACGTAC AATTCGTTAC TGTGAAAGCG GGACGTAGTG GAAAATCAAC TAGGAAACAT ACTAAGTTGA TCGATACTTT AGATATCTTA ATTGCTGATG GGAGGGGAAC GCGAGCCTAT GAAACTTATT CTGGGGGCGA AGCGTTTAGA ATTAATTTTG CGATTCGCTT GGCCTTAGCG AAATTATTAG CTCAACGTGC GGGAGCAGCC TTACAACTGT TAATTGTAGA TGAAGGCTTT GGGACTCAGG ATAATGAAGG GTGTGATCGC TTGATTGCGG CGATTAATGC GATCGCTAGT GATTTCGCCT GTATACTTAC AGTAACTCAT ATTCCCCACC TCAAAGAAGC CTTCCAAGCG CGGATAGAGG TTAACAAAAC TCAACAAGGT TCACATATAT ATCTATCAAT TTAA
|
Protein sequence | MIPVQLILKN FLSYRDATLD FGGLHTACIC GSNGAGKSSL LEAITWSIWG QSRATVEDDV IYSGAKEVRV DFTFYNNQQT YRVIRTRARG ATSILEFQIE TPAGFRPLTR KGMRATQDVI IQHIKLDYET FINSAYLRQG RADEFMLKRP TERKEILAEL LKLDQYDVLE ERAKDSSKLY KGRAEELERS LDNIKVQLQQ QETTKAQRVE LESQLNSLQQ QQALDNIQLQ SLQVVEHKRQ NWEQQLNFVW QQYQNLSQDC DRLHEEQLAV KSQLADLKVI LNQAAEIIAG YAQYQSLQSQ EEAFAVKSEQ HTRATSFRQQ QQQELTKQVQ TIEYRFQQAQ AQLEGLEQQE QEIQQTLTKS SEVETALAQL ASARKHLNYF DQLQMQVNPL LQQRLSLQNQ LDRTRASLVA RLEQLQATET QLQSQYRRQP QLQQAALDVG IQIEELEKKR VYLQRVQEKG QERRHFIERL QVHQRDYEKL LGELEQKLQV LQSPNALCPL CERHLDEHHW SRVIQKTQLE YEDTQGQFWV VREQMAVSDR EIQVLRQEYR EISQQLAGYD ALREQRGQLA AKLQATTDVQ EQLQQIALER EHLESSLQGD YAPDKQVELQ QLEQYLQQLN YNEQDHTLAR SEVERWRWAE IKQAQIKDAT KKQAQLAARK PKLQATIDEF KLKIQLEQTD SDTAKQIEAL TQEIKELNYS SEQHNKLRQA VRESQSWQLR YQQFLSAQQK YPQLETRLED LASSYKSRLA DQQRFATQID SIVEQLKATA NPTEQINALE QQIAIRRREL DEKIANLGRV EQLLHQLQTL QTQYVQEQEQ LKYCQQQHRV YHELTQAFGK NGIQALMIEN VLPQLEAETN QLLSRLSANQ LHVQFVTVKA GRSGKSTRKH TKLIDTLDIL IADGRGTRAY ETYSGGEAFR INFAIRLALA KLLAQRAGAA LQLLIVDEGF GTQDNEGCDR LIAAINAIAS DFACILTVTH IPHLKEAFQA RIEVNKTQQG SHIYLSI
|
| |