Gene Aazo_1770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1770 
Symbol 
ID9339563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1833472 
End bp1835142 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content41% 
IMG OID 
Producttransglutaminase domain-containing protein 
Protein accessionYP_003721019 
Protein GI298490842 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCTG CACTCCCTAG TTTGACTGTT AGCCAGATGT TTGGGCAAAA AACAATCCGA 
CCGCTTACTG CTGCTACGAT TTATGGTATT GCTTTCATCA AAGATAGACT TATTGCTATC
GACACGATTA AAGGTCATCT GTTGGAGATT GACCCTATCT CTGATAACAG TAAAATTCTC
AATCCTCACC AAGTCAAGGA ATTTCAGGAC GTTACTGGTT TAGCAGTGTG GGAAGATACA
CTGTGGATCA GTCGTCGTAA TGGCATTTAT TTGTGTAACT TTACCTCCTT GGGTCTGGAG
CACTTTGTTA CTTTGCCTTA TGCTGCTGAT GGTGTTGGTG TTTGGGAATC GACGATTTAT
GTTAGCTGCC AAAGACTAGC CTCTATTATC GTTTTCGACC GCGATACACG CAAAGAAATT
ACTAGATTTC ATACTCCTGG TGTTGGTATA GAAAATTTAG CTGTTAGTCA GGAAGCTTTG
TGGGTTTGCG ATCACACTGA ACAAACAGTT TATTCTATGG ATCGGGCAAC TGGAGAATTT
CGCTTTAGTG TTATTACACC TTTTGCTTCT CCTACAGGTA TCGCTATTCA TAGCCAAGAC
GCAACAGGCA AGGATATTCT TTATGTTTCC TACTCCACAG AAGAGCCCTA TATCCGTGAT
AACCCCAATG CTGACCCTAG TCATGAGCTA ACATACCGAG ATATCACTTT TATTCATCCC
CTGTATTATC ATTACGAGCC AGATAAACGC TACGCGCTAT CTAATGGTTA TCTGGTAGAA
ATGTCCTATG TAGAGGAAAT TTCTCCCTTG GAGGAGGTTT ATTTACCTAA TGTAGAATGG
CGTATCGCGC TACCATCAGA AACAGAACGC CAAAAGGTCA AACAAGTTGA ACCGATTGGT
TTACCCTTCA CAGAAGAAAT TATTGATGGA CAACGGGTAG CCGTATTTAA GTTTGATTCT
CTCGCTCCCG GAGAACGTCA TATATTTGGC TGGAAAGCAC TTTTGGAAGT CCGGGGTATT
AAGTATCGCA TCACTCCTAA AGATGTAGAA ATTCTGCCTG AACTGACACC AGAACTACAA
ACGCGCTATT TAGTAGATGA TGATGATTTA GCAATGAATA CTGCTATTGT CAATCGTGCA
GCCCGTGAAG CGGTCGGTTC GGAAACAAAT ATATTGCGAA AAATGTACAG TATCCGCAAC
TATGTTTATG ATGAGTTGTC TTATGGGATT AAACCCTATA TAGATACGCC AGATATCGTT
TTAGAACGGG GAGTCGGTTC ATGTGGCGAA TATGTAGGAG TATTATTGGC TTTGTGCCGT
TTGAATGGCA TTCCTTGTCG GACTGTAGGT AGGTATAAAT GCCCCCCCTA TGGTGAACAA
CAGGGAGTTC CGCTGCAACC CGATTTTAAT CATGTTTGGC TAGAGTTCTA TATCCCCAGT
TTTGGTTGGG TGCCAATGGA ATCAAATCCT GATGATATAG GTGATACTGG TCACTATCCC
ACACGCTTTT TTATGGGATT ATGTTGGTAT CACGTTGAAA TTGGAAGAGG AGTCACTTTT
GAAACTTTAA CCAGTAATGG TGATCGGTTA ACAAAAGAAG ACATATCCAT TGGCGACTTG
GCCATTAATC ATATTCGGTT TACAATTCTT AAAGAATTAC CACCTTTTTA A
 
Protein sequence
MNSALPSLTV SQMFGQKTIR PLTAATIYGI AFIKDRLIAI DTIKGHLLEI DPISDNSKIL 
NPHQVKEFQD VTGLAVWEDT LWISRRNGIY LCNFTSLGLE HFVTLPYAAD GVGVWESTIY
VSCQRLASII VFDRDTRKEI TRFHTPGVGI ENLAVSQEAL WVCDHTEQTV YSMDRATGEF
RFSVITPFAS PTGIAIHSQD ATGKDILYVS YSTEEPYIRD NPNADPSHEL TYRDITFIHP
LYYHYEPDKR YALSNGYLVE MSYVEEISPL EEVYLPNVEW RIALPSETER QKVKQVEPIG
LPFTEEIIDG QRVAVFKFDS LAPGERHIFG WKALLEVRGI KYRITPKDVE ILPELTPELQ
TRYLVDDDDL AMNTAIVNRA AREAVGSETN ILRKMYSIRN YVYDELSYGI KPYIDTPDIV
LERGVGSCGE YVGVLLALCR LNGIPCRTVG RYKCPPYGEQ QGVPLQPDFN HVWLEFYIPS
FGWVPMESNP DDIGDTGHYP TRFFMGLCWY HVEIGRGVTF ETLTSNGDRL TKEDISIGDL
AINHIRFTIL KELPPF