Gene Aazo_4053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4053 
Symbol 
ID9341858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4112734 
End bp4114491 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content43% 
IMG OID 
Productcell wall hydrolase/autolysin 
Protein accessionYP_003722635 
Protein GI298492458 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.630004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAG TCTTAGGATT AGTAGTGTTT AACTGTCTGT TTACCTCCTC CGTTGCTTTA 
GCACAAACAT CACTCCTAGT AGTTTTTCCC CCAACAAACT ACCAAACCAG TGCAGAAAAA
ATATTTTTTA TCGGTACAGC GCCAACGGAT GGACAAGTCC TTATCAATAG TAGACCAATT
ACCTGCAGTA AAGCTGGTCA TTTTTCTCCC AGTTTCCCCT TACAGTTAGG GGAGAATGTA
TTTAAGGTAC GTTACCAGAA TCAGGAACTA GAGATTAAGG TAACAAGGTT GTCTACTCAA
TCAGAATTAC CCCAAGGTTT AGGCTTTGCT AAAGATTCTC TGACTCCTGC TGCTGATATT
GCTAGACTGC CGGGAGAATT GATTTGTTTT GGTGCTGTTC CACCTCCTCA AGCTACTGTC
TCTGTCAAGC TGGCTAATCA AACCATTCCC TTGTCACCAC AACCACCACA GGCGCAATTA
CCTGCTAATT CAAGTGTACT AACGGGAATA AATAAGCCTA GTACTAGTAC TACTAGTCCT
AAAAAATATC AAGGCTGTAC AACAGTGGCC AATGTTGCTG ATTTGGGACA ACCTCAATTT
AGTTTGACAT TGAATGGTCA GACCATCACT CAAACTGGTA AGGGCAGAAT TCAAATTCTT
GATGCTGCAC AGTTAACAGT TGTTGAAGTA ACAGCAACTT CAGGGGTGAC TCGTACAGGA
GCAAGCACAG ATTATTCTCG ACTCACGCCA CTACCAAAAG GTACAAGGGC AACAGTGACA
GGTAAAGAGG GTGATTGGTT ACGCTTAGAC TATGGGGCTT GGATCAATAG CAAAGAAACC
AAAATTATAC CAGATGCACT ACCACCACAG ACGGTAATTA GTAGTGTCGG ATATCGTCAG
CTTCCAGGTG CGACAGAGAT GATTTTTCCA TTACAAATGG CTGTACCTGT GAGTGTGGAA
CAGAGCGATC GCACTTTCAC ACTCACCCTT TACAATACCA CTGCCCAAAC AGACACTATT
CGTTTGGATG ATAACCCCCT AATTTCCCGG CTAGATTGGC AACAGGTCAC TCCACAACAG
GTTAAATACA CCTTTAACCT CAAAAATCTC CAGCAGTGGG GCTATAACCT GAGATACGAC
AATACAACTA TGGTGTTAAC TTTACGTCAT GCACCGCATC TTGAACAAAG AAAACGCCTG
CCTCTATCTG GCATCAAGAT TGTACTTGAT CCGGGACACG GTGGTAAAGA ATCTGGTGCA
AGTGGTCCAA CGGGGTATTT AGAAAAAGAT GTAAATTTGA TAGTTTCTAA GTTACTGCGA
GATGAGTTAG TGCAGCGTGG TGCAACAGTA ATGATGACAA GGGAAGATGA TCAGGATGTT
TCTTTAGTAG AACGTCAGGA GATAATTAGT AAAGAAGAAC CTGCGATCGC ACTTTCTATC
CATTACAATT CTTTACCTGA TGATGGAGAT GCCGAAAAAA CCAAAGGCTT CGGGGCTTTT
TGGTATCATC CCCAATCACA TAGCCCAGCA GTGTTTTTAC ATAATTACGT AGTCAAAAAA
CTCCGAAAAC CTTCCTATGG CGTCTTTTGG AAGAATTTAG CCCTGACTCG TCCCTCTATT
GCACCTTCCG TACTATTGGA ATTGGGTTTT ATGAGTAATC CCTATGAGTT TGAAGAAGTA
GTGAACCCAG AGGAACAGAA GAAAATGGCC AAGACTCTGG CTGATGGGGT GACAGAGTGG
TTTAAAGCGG TCAAGTAA
 
Protein sequence
MKKVLGLVVF NCLFTSSVAL AQTSLLVVFP PTNYQTSAEK IFFIGTAPTD GQVLINSRPI 
TCSKAGHFSP SFPLQLGENV FKVRYQNQEL EIKVTRLSTQ SELPQGLGFA KDSLTPAADI
ARLPGELICF GAVPPPQATV SVKLANQTIP LSPQPPQAQL PANSSVLTGI NKPSTSTTSP
KKYQGCTTVA NVADLGQPQF SLTLNGQTIT QTGKGRIQIL DAAQLTVVEV TATSGVTRTG
ASTDYSRLTP LPKGTRATVT GKEGDWLRLD YGAWINSKET KIIPDALPPQ TVISSVGYRQ
LPGATEMIFP LQMAVPVSVE QSDRTFTLTL YNTTAQTDTI RLDDNPLISR LDWQQVTPQQ
VKYTFNLKNL QQWGYNLRYD NTTMVLTLRH APHLEQRKRL PLSGIKIVLD PGHGGKESGA
SGPTGYLEKD VNLIVSKLLR DELVQRGATV MMTREDDQDV SLVERQEIIS KEEPAIALSI
HYNSLPDDGD AEKTKGFGAF WYHPQSHSPA VFLHNYVVKK LRKPSYGVFW KNLALTRPSI
APSVLLELGF MSNPYEFEEV VNPEEQKKMA KTLADGVTEW FKAVK