Gene Aazo_3440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3440 
Symbol 
ID9341244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3507647 
End bp3509881 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content42% 
IMG OID 
Productfamily 57 glycoside hydrolase 
Protein accessionYP_003722198 
Protein GI298492021 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCACC CTCTATACGT CGCTTTTATC TGGCATCAAC ATCAGCCGTT ATATAAATCT 
CCTCGTAGTG GTGTTTCTAC ACCTGCTAGT CAGCAGTACC GCTTGCCTTG GGTTAGGTTA
CATGGTACTA AGGATTATTT GGATTTAATC TTGCTGCTAG AGCAGTATCC CAAGTTACAT
CAAACGGTGA ATTTAGTACC ATCCTTGATA CTGCAACTAG AAGATTATAT TGCTGGTACT
GCGTTTGACC CTTATCTGAC TGCCAGCTTA ACACCTGTTG AAAAGTTAAC TCAGGAACAG
AAAGAATTTA TTGTTGAACA TTTTTTTGAT GCTAATCACC ATACTTTAAT AGATCCCCAT
CCCCGCTATG CCCAGTTGTA CTATCAGAGA CAGGAAAAGG GACAAGCGTG GTGTTTAGCA
AATTGGCAGC CACTAGATTA CAGTGACTTA TTAGCTTGGC ATAATCTAGC TTGGATTGAT
CCTCTGTTTT GGGATGACCC AGAAATTGAA GCTTGGTTAA AACAGGGACA AAATTTTACT
TTAAGTGATC GCCAACGCAT TTATTCTAAA CAACGTCAAA TCCTCAGCCG CATTATTCCC
CAACACAGGA AAATGCAAGA AACTGGACAG TTAGAAGTCA CCACCACCCC CTATACTCAC
CCAATTTTGC CCTTGTTAGC TGATACCAAC TCCGGGCAGG TAGCAGTGCC AAACATGACA
TTACCTAACA ATCATTTTCA GTGGGCGGAA GATATTCCTC GTCATTTACA GAAATCTTGG
GATTTATATA AAGACAGATT TGGACAAGAA CCACGGGGTT TATGGCCTTC TGAACAGTCA
GTTAGTCCAG AAATATTACC GTATATTATT AAACAGGGCT TTAATTGGAT TTGCTCAGAT
GAAGCCGTCT TAGGTTGGAC CTTAAAACAC TTTTTCCATC GAGATGGGGC AGGAAATGTC
CAGCAACCAG AATTACTGTA CCGTCCTTAT CTTTTGCAAA CTCCAGCAGG TGATTTATCC
ATAGTTTTCC GTGACCATAG GTTGTCAGAT TTAATTGGTT TCACATACAG TTCCATGCAG
CTAAAACAGG CCGTAGCGGA TTTAGTGGGA CATTTGCAAG TGATCGCTAA AATGCAAAGA
GAGAAACCCA GCGAACAACC TTGGTTAGTA ACCATCGCCT TAGATGGTGA AAATTGCTGG
GAATTTTATC CCCAAGACGG CAAACCATTC CTAGAAACCT TATATCAAAG CTTGAGTAAT
GAACCTCATA TCAAACTGGT TACCGTCTCG GAATTCCTAG ACAAATATCC CGCCACAGCC
ACTATCCCCG GAGAACAACT CCATAGTGGT TCTTGGGTAG ATGGCAGTTT TACCACCTGG
ATAGGAGATC CCGCCAAAAA TCGCGCTTGG GACTACCTGA CCCAAGCCAG ACAAGTATTA
GCCAATCATC CCGAAGCTAC CGAAGACAAC AACCCTGCAG CCTGGGAAGC CTTATATGCA
GCCGAAGGTT CCGATTGGTT TTGGTGGTTT GGAGAAGGAC ATTCTTCAAA TCAAGATGCC
ATGTTTGACC AATTATTTCG TGAACATCTC TATGGAATTT ATAAAGCCCT CAATGAACCA
ATACCAGTCT ATTTAACAAA ACCAGTAGAA GTCCATGAAA CACGAGCAGA CCGTCGGCCA
GAAGCCTTTA TTCACCCAGT TATTGACGGT AAAGGTGATG AACAAGACTG GGACAAAGCC
GGCAGAATAG AAGTTGGTGG TGCAAGGGGA ACAATGCACA ACAGCAGCCT CATTCAACGA
CTTTGGTATG GAGTAGATCA CCTGAATTTT TACTTACGGG TAGATTTTAA AAGTGGCATT
GCACCAGGAA AAGAACTACC AACAGAGTTA AATTTACTTT GGTATTATCC AGATAGAACA
ATGGTTAATA GCCCTGTAAC TTTAGCAGAA GTTCCAGATA TATCACCAGT TAATTATCTG
TTTCACCATC ATCTAGAAAT TAATTTACTC ACACAATCAA TTCAATTTCG AGAAGCAGGA
AATAACTATC AATGGTATCC CCGCGTTAGT CGCGCCCAAG CTGCTTTAAA TACTTGTTTA
GAAGTGGCAA TACCTTGGGC AGATTTGCAA GTTCCCCCAG ATTATCCCCT GCGTCTGATT
TTGGTACTAG CCGATGATGG GTGTTTCCAT AGCTATTTAC CAGAAAATGC TTTAATTCCT
ATTGAAGTAC CTTAG
 
Protein sequence
MSHPLYVAFI WHQHQPLYKS PRSGVSTPAS QQYRLPWVRL HGTKDYLDLI LLLEQYPKLH 
QTVNLVPSLI LQLEDYIAGT AFDPYLTASL TPVEKLTQEQ KEFIVEHFFD ANHHTLIDPH
PRYAQLYYQR QEKGQAWCLA NWQPLDYSDL LAWHNLAWID PLFWDDPEIE AWLKQGQNFT
LSDRQRIYSK QRQILSRIIP QHRKMQETGQ LEVTTTPYTH PILPLLADTN SGQVAVPNMT
LPNNHFQWAE DIPRHLQKSW DLYKDRFGQE PRGLWPSEQS VSPEILPYII KQGFNWICSD
EAVLGWTLKH FFHRDGAGNV QQPELLYRPY LLQTPAGDLS IVFRDHRLSD LIGFTYSSMQ
LKQAVADLVG HLQVIAKMQR EKPSEQPWLV TIALDGENCW EFYPQDGKPF LETLYQSLSN
EPHIKLVTVS EFLDKYPATA TIPGEQLHSG SWVDGSFTTW IGDPAKNRAW DYLTQARQVL
ANHPEATEDN NPAAWEALYA AEGSDWFWWF GEGHSSNQDA MFDQLFREHL YGIYKALNEP
IPVYLTKPVE VHETRADRRP EAFIHPVIDG KGDEQDWDKA GRIEVGGARG TMHNSSLIQR
LWYGVDHLNF YLRVDFKSGI APGKELPTEL NLLWYYPDRT MVNSPVTLAE VPDISPVNYL
FHHHLEINLL TQSIQFREAG NNYQWYPRVS RAQAALNTCL EVAIPWADLQ VPPDYPLRLI
LVLADDGCFH SYLPENALIP IEVP