Gene Aazo_5144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5144 
Symbol 
ID9342952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5271281 
End bp5272495 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content40% 
IMG OID 
Productpeptidase M23 
Protein accessionYP_003723331 
Protein GI298493154 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACGG CATTTTGGGG AAAAATCAGA TTTGGGTGGT GGTTTCCGAT ATTGTGCTGC 
ATTTTGGGTA GTGTTTTGGT ATTGCTACCA GCATACGCAG TGTCTTCACC AACCATAAAC
AATCTCAAAC AACAGCAACA GCAAATTCAG CAACAGCATC AGAATGTGGT TCAAGAACAG
AATCGCTTAA CGAATCTGCA AAAAGAAGCA CAAAAAAATC TTGGTGGTTT AAATCAAAAT
CTCCAAACTA CTAATAGCCA AATTAAAGAT AGTGAAGCCA GGTTAAAATC CGCTACAAAA
CGACTCGAGC AACTCGAAGC TGATTTGGTT CTAGCAGAAC GTAGTTATCA AGAAAGACAA
GCTGCAACCG TGGCGAGATT GCGGTATCTT CAGCGTTCAT CGGCTTCTCA AGGATTAGCT
GTTTTGTTGC AAAGTCATAA TTTGAGTGAT TTTAATAGTC ATCGTCATCA GTTGAAGTTA
GTTTATCAGG CAGACCAACA AATTTTAGCT AAATTAGCAA CCCAAGGAAA TTTCTTAATT
CGGCAAAAAA CGGAAGTAGA AACGCAAAAA AATTTGATGG CTTTAATTAG AGAACAATTA
TTAGTTCAAA AAGCTGATTA TCAAAATCAA GCGCAGTCAC AAGCAGAATT AATTTCTCGT
TTAAATAGCG ATCGCTTGGC CTTGGAAGCA GCACAAAACC AACTAGAGAA AGATTCCCAA
AATTTCACTG TTTTGATTCA ACAAAAAATC GCTGAACAAC AAGCGAAAGA AGCCAAGGAA
GCAGCACAGA CTAAAGCTAA TAGCAAAATT TGGGTTCTTG GTACAGGTAT TTTTGCCTTT
CCTAGTGATG CACCTACCAG TAGTCCTTTT GGTTGGCGGA TACACCCTAT TCTTGGTTAT
CGCCGCTTTC ACGCCGGTTT GGATTTTGCC GCTAGTTATG GTAGTACGAT TAGAGCCGCA
GATTCAGGTA CAGTGATTTT TGCTGGCTGG TATGGTGGTT ATGGCAAAGC TGTGATTATT
AGTCATGGTA AAGGAATTAC CACCCTATAT GGGCATACCA GTGAGTTGTA TGTGACAGAA
GGGCAATCAG TTCAAAAAGG ACTAGCGATC GCAGCTGTAG GTTTCACAGG TTTATCCACA
GGTCCCCACC TCCATTTTGA AGTTAGGCGC AATGGTACAC CTGTTGACCC AGCTAATTAT
TTAGGTTTGC TGTAG
 
Protein sequence
MNTAFWGKIR FGWWFPILCC ILGSVLVLLP AYAVSSPTIN NLKQQQQQIQ QQHQNVVQEQ 
NRLTNLQKEA QKNLGGLNQN LQTTNSQIKD SEARLKSATK RLEQLEADLV LAERSYQERQ
AATVARLRYL QRSSASQGLA VLLQSHNLSD FNSHRHQLKL VYQADQQILA KLATQGNFLI
RQKTEVETQK NLMALIREQL LVQKADYQNQ AQSQAELISR LNSDRLALEA AQNQLEKDSQ
NFTVLIQQKI AEQQAKEAKE AAQTKANSKI WVLGTGIFAF PSDAPTSSPF GWRIHPILGY
RRFHAGLDFA ASYGSTIRAA DSGTVIFAGW YGGYGKAVII SHGKGITTLY GHTSELYVTE
GQSVQKGLAI AAVGFTGLST GPHLHFEVRR NGTPVDPANY LGLL