Gene Aazo_3687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3687 
Symbol 
ID9341492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3749937 
End bp3751289 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content40% 
IMG OID 
Productcell wall hydrolase/autolysin 
Protein accessionYP_003722365 
Protein GI298492188 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACGTA TTTTCATTTC GGCGGCGCAT GGGGGCAGGG AAGCGGGTGG AATTGATTCT 
GGTGCGATCG CAGGTGGTAC AACTGAAGCA AAAGAAATGA TTCTGTTACG GGATTTAATT
GTTACAGAAA TTAGAGTGCG TACGGTGGAA GTCTTAGCTG TTCCTGATGA TTTAAGTGCT
GCTCAAACTA TTACCTGGAT TAATTCTCGC GGTCGAACTG GTGATGTCGC CGTAGAAATT
CACACCGATG CTGCTGGTAG TCCTACAGTT AGGGGGGCTG GAGTTTTTTA TATTGCTAAC
AACAATCAAC GCAAGCAAAA TGCAGAAATG GTGCTGATGG GATTGTTACG TCGTGTCCCT
CAATTACCTA ATCGTGGAGT TAAACCCGAT ACAGATAGCG GCTTAGGTAG TTTACAATTC
TGTCGTCAGA CAAATGTCCC TGCTTTATTA ATGCAAGTAG GCTTTATTAG CAGTCCAGAT
GATCACAGTT TGTTACAAAC TCGTCGCCGT GATTTTGCTT TAGGAATAGT GGATGGATTG
GTGGCTTGGA GTAGGGCAGT TGACCCTAAT CCGGGAACTC CCCCACAACG AACTTATCCA
CCCATTAATA TTAACATTAA TGGTCGAAAT TATGCAGAGC AAGGTGTATT GATTAATGGT
AACTCTTATA TCCCTATTGA TTTAGTAGAT CGCTTACGGA TTGATTTATC CAAATCACCT
AATGTTGTTC GTGTTACCTA TCAGCGAGTA GTTTACATTA AAGCCATTGA ACTTCGAGAG
TTTAATGTTT CTATTGAATG GGATAGTGCA ACAAAAACTC TGAACCTACG TTCAATTTTA
TCAGTTTGTC CAGGACAAAT TAACCAAATT GTTTCTAATG GTAATACCAC AGAATTACAG
CTACAATTAT TTTTGCGAAA TAACAATGAA AATGCTCTCG TCCAGTTTCC TGATATTCCC
AAACTCTATC GAGAAGAAGC GGCAATGGAA GGAGTAAATC ATGATATTGC GTTTTGTCAG
ATGTGTTTGG AAACTGTCTT TTTACGATTT GGTAGTGATA TTAAACCTCA ACAAAATAAT
TTTGCTGGTT TAGGTGCAAT AGGTGGTGGA ACACAAGCAG CTTCTTTTTC TAGTGCCAGA
ATTGGAGTGA GAGCGCATAT CCAACATTTA AAAGCCTATG CCAGTTTAGA ACCTTTGGTA
CAACAAGAGG TAGATCCTCG GTTTAGATTT GTAACTAGAG GTGTAGCTCC TTCTGTTGAT
CAGTTATCAG GAAGATGGTC AGCAGACTTA GATTATGGTA CTAAAATTAA AGCCATGTTT
AAAAGACTTT ATGAATCAGC AAAATTGATT TAA
 
Protein sequence
MGRIFISAAH GGREAGGIDS GAIAGGTTEA KEMILLRDLI VTEIRVRTVE VLAVPDDLSA 
AQTITWINSR GRTGDVAVEI HTDAAGSPTV RGAGVFYIAN NNQRKQNAEM VLMGLLRRVP
QLPNRGVKPD TDSGLGSLQF CRQTNVPALL MQVGFISSPD DHSLLQTRRR DFALGIVDGL
VAWSRAVDPN PGTPPQRTYP PINININGRN YAEQGVLING NSYIPIDLVD RLRIDLSKSP
NVVRVTYQRV VYIKAIELRE FNVSIEWDSA TKTLNLRSIL SVCPGQINQI VSNGNTTELQ
LQLFLRNNNE NALVQFPDIP KLYREEAAME GVNHDIAFCQ MCLETVFLRF GSDIKPQQNN
FAGLGAIGGG TQAASFSSAR IGVRAHIQHL KAYASLEPLV QQEVDPRFRF VTRGVAPSVD
QLSGRWSADL DYGTKIKAMF KRLYESAKLI