Gene Aazo_0696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0696 
Symbol 
ID9338482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp733119 
End bp736142 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content40% 
IMG OID 
Productexonuclease SbcC 
Protein accessionYP_003720284 
Protein GI298490107 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCCAG TTCAACTTAT CCTTAAAAAC TTCCTTAGTT ACCGTGATGC AACTTTAGAT 
TTTGGCGGTT TGCATACGGC TTGTATTTGT GGTTCTAATG GTGCAGGTAA ATCTTCCCTT
CTGGAAGCTA TCACTTGGTC TATTTGGGGT CAAAGCCGTG CCACTGTTGA AGATGATGTT
ATCTATTCTG GCGCAAAAGA AGTCAGAGTT GATTTTACTT TCTACAATAA CCAACAAACT
TATCGCGTAA TTCGTACTCG CGCACGGGGT GCCACTAGCA TCCTGGAATT TCAAATTGAA
ACTCCTGCCG GTTTTCGTCC CCTAACTCGC AAAGGGATGC GAGCAACGCA GGATGTGATT
ATACAACATA TCAAGCTCGA TTACGAAACC TTTATTAATT CTGCTTACTT ACGTCAAGGA
CGAGCAGATG AATTCATGCT CAAACGTCCT ACTGAACGGA AGGAAATTTT AGCGGAGTTG
TTAAAACTCG ATCAATATGA TGTATTGGAA GAAAGAGCTA AGGACAGTTC TAAACTTTAT
AAAGGAAGGG CGGAAGAGTT AGAGCGTTCT TTGGATAATA TCAAAGTTCA ACTCCAACAA
CAGGAAACAA CAAAAGCGCA AAGAGTGGAG TTAGAATCTC AACTTAATAG TCTTCAACAG
CAGCAAGCTC TTGATAATAT TCAATTGCAA AGTTTGCAAG TTGTCGAACA TAAACGCCAA
AACTGGGAAC AACAACTGAA TTTTGTCTGG CAACAATATC AAAATCTTAG CCAAGATTGT
GATCGCTTGC ATGAAGAACA ATTAGCTGTT AAATCCCAAT TAGCAGATTT AAAAGTCATT
TTAAATCAAG CTGCCGAAAT TATCGCCGGA TACGCTCAAT ATCAGAGTCT ACAATCCCAA
GAAGAGGCTT TTGCTGTTAA ATCTGAACAA CATACCCGCG CTACCAGCTT CCGACAACAA
CAACAACAAG AGCTTACTAA ACAAGTCCAA ACAATTGAAT ACCGATTTCA ACAAGCTCAA
GCTCAATTAG AAGGTTTAGA ACAACAAGAG CAAGAAATTC AACAAACTCT CACTAAATCT
TCGGAAGTAG AAACTGCTTT AGCTCAATTA GCTTCGGCTC GTAAGCATCT TAATTATTTC
GATCAGTTGC AAATGCAAGT GAATCCTTTA TTACAACAAC GGTTAAGTTT ACAGAATCAA
TTAGATCGCA CTCGTGCTAG TTTAGTAGCG CGGCTGGAAC AACTGCAAGC TACAGAAACC
CAACTCCAAA GTCAATATCG TCGTCAACCA CAACTACAAC AAGCGGCGCT AGATGTGGGT
ATACAAATTG AAGAACTGGA GAAAAAACGG GTGTATTTAC AGCGGGTGCA GGAAAAAGGA
CAGGAACGCA GGCACTTTAT CGAACGTTTA CAAGTACACC AACGAGATTA CGAGAAATTA
CTGGGAGAAC TAGAGCAGAA ATTACAAGTA CTCCAAAGTC CTAATGCTTT GTGTCCTTTG
TGTGAACGTC ATCTAGATGA GCATCACTGG AGTCGGGTTA TACAAAAAAC CCAACTAGAG
TATGAAGATA CCCAAGGACA ATTTTGGGTA GTGCGGGAAC AAATGGCGGT TTCTGACAGA
GAAATTCAGG TACTTAGACA AGAATATCGA GAAATTTCTC AGCAATTAGC TGGTTATGAT
GCTTTGCGTG AACAAAGGGG ACAATTAGCT GCAAAATTAC AAGCAACTAC AGATGTTCAA
GAGCAGTTAC AACAAATTGC TCTGGAAAGA GAACATTTAG AAAGTTCTTT GCAAGGAGAT
TATGCTCCTG ATAAACAAGT AGAACTCCAG CAATTAGAGC AATATCTGCA ACAGTTGAAT
TATAATGAAC AAGACCATAC TTTAGCTAGA AGTGAAGTAG AGCGTTGGCG ATGGGCAGAA
ATTAAACAAG CACAAATTAA AGATGCTACT AAAAAACAGG CTCAATTAGC AGCCAGAAAA
CCAAAATTAC AAGCTACTAT TGACGAATTT AAGCTCAAAA TTCAGTTAGA ACAAACAGAT
TCTGATACAG CTAAACAAAT AGAAGCTTTA ACTCAGGAAA TTAAAGAGCT TAACTACAGT
TCTGAACAAC ACAATAAGTT GCGTCAAGCT GTACGTGAGT CACAATCTTG GCAGTTGCGT
TATCAACAGT TTTTGTCGGC TCAACAAAAG TATCCTCAAC TTGAGACAAG ATTAGAAGAT
TTGGCAAGTT CTTACAAGAG TAGATTAGCA GATCAGCAAA GATTTGCTAC TCAAATTGAC
AGCATTGTAG AGCAATTAAA AGCTACAGCG AACCCGACGG AGCAAATTAA TGCTTTAGAA
CAGCAAATAG CGATTCGCAG AAGGGAACTT GACGAGAAAA TAGCTAATTT GGGGCGTGTA
GAACAACTAT TACATCAATT ACAAACGTTG CAGACTCAGT ATGTGCAAGA ACAGGAACAA
TTAAAATATT GTCAGCAGCA ACATCGTGTT TATCACGAAT TAACGCAAGC TTTTGGTAAA
AATGGTATCC AAGCGTTGAT GATTGAAAAT GTGTTACCAC AACTAGAAGC TGAGACAAAT
CAACTACTTT CACGGTTGAG TGCTAATCAA CTACACGTAC AATTCGTTAC TGTGAAAGCG
GGACGTAGTG GAAAATCAAC TAGGAAACAT ACTAAGTTGA TCGATACTTT AGATATCTTA
ATTGCTGATG GGAGGGGAAC GCGAGCCTAT GAAACTTATT CTGGGGGCGA AGCGTTTAGA
ATTAATTTTG CGATTCGCTT GGCCTTAGCG AAATTATTAG CTCAACGTGC GGGAGCAGCC
TTACAACTGT TAATTGTAGA TGAAGGCTTT GGGACTCAGG ATAATGAAGG GTGTGATCGC
TTGATTGCGG CGATTAATGC GATCGCTAGT GATTTCGCCT GTATACTTAC AGTAACTCAT
ATTCCCCACC TCAAAGAAGC CTTCCAAGCG CGGATAGAGG TTAACAAAAC TCAACAAGGT
TCACATATAT ATCTATCAAT TTAA
 
Protein sequence
MIPVQLILKN FLSYRDATLD FGGLHTACIC GSNGAGKSSL LEAITWSIWG QSRATVEDDV 
IYSGAKEVRV DFTFYNNQQT YRVIRTRARG ATSILEFQIE TPAGFRPLTR KGMRATQDVI
IQHIKLDYET FINSAYLRQG RADEFMLKRP TERKEILAEL LKLDQYDVLE ERAKDSSKLY
KGRAEELERS LDNIKVQLQQ QETTKAQRVE LESQLNSLQQ QQALDNIQLQ SLQVVEHKRQ
NWEQQLNFVW QQYQNLSQDC DRLHEEQLAV KSQLADLKVI LNQAAEIIAG YAQYQSLQSQ
EEAFAVKSEQ HTRATSFRQQ QQQELTKQVQ TIEYRFQQAQ AQLEGLEQQE QEIQQTLTKS
SEVETALAQL ASARKHLNYF DQLQMQVNPL LQQRLSLQNQ LDRTRASLVA RLEQLQATET
QLQSQYRRQP QLQQAALDVG IQIEELEKKR VYLQRVQEKG QERRHFIERL QVHQRDYEKL
LGELEQKLQV LQSPNALCPL CERHLDEHHW SRVIQKTQLE YEDTQGQFWV VREQMAVSDR
EIQVLRQEYR EISQQLAGYD ALREQRGQLA AKLQATTDVQ EQLQQIALER EHLESSLQGD
YAPDKQVELQ QLEQYLQQLN YNEQDHTLAR SEVERWRWAE IKQAQIKDAT KKQAQLAARK
PKLQATIDEF KLKIQLEQTD SDTAKQIEAL TQEIKELNYS SEQHNKLRQA VRESQSWQLR
YQQFLSAQQK YPQLETRLED LASSYKSRLA DQQRFATQID SIVEQLKATA NPTEQINALE
QQIAIRRREL DEKIANLGRV EQLLHQLQTL QTQYVQEQEQ LKYCQQQHRV YHELTQAFGK
NGIQALMIEN VLPQLEAETN QLLSRLSANQ LHVQFVTVKA GRSGKSTRKH TKLIDTLDIL
IADGRGTRAY ETYSGGEAFR INFAIRLALA KLLAQRAGAA LQLLIVDEGF GTQDNEGCDR
LIAAINAIAS DFACILTVTH IPHLKEAFQA RIEVNKTQQG SHIYLSI