Gene Aazo_0601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0601 
Symbol 
ID9338387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp631731 
End bp633371 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content40% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003720212 
Protein GI298490035 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATA GCGGTTTTCA CATCTCTACT TTATCTTTAG GCGCTAGTCT AACCGCTTTG 
TTGGTAATTG CTAGTAGTTC CATGCTATTA ACCAGTCAAG TAAATGCTGG TTCTCCTCCT
ACTGTGATTG CACAAATTCC AGCAAGTGCA ACAGTGATTT ATGTTAATCC TGTGATCGGT
CAAGATAGTT CTAGTGCTGG TATTACCCCA GAAGCACCCT ATAAAACTAT TACCTTCGCT
CTTTCGCAGG CTAAATCAAA TACAGTGATT AAACTCGCTC CTGGTACCTA TACTAAGGAT
ACTGGGGAAA CTTTTCCCTT ACTGCTTAAA CTAGGAGTGA TACTTGTCGG TAATGAATCT
ATCAAAGGTC AAGGAACAGT CATCATCGGT GGTGGTCATT ATATCAGTCG TACCTTTGCT
AGACAAGATA TTACCATCCT AGCGGAAAAT ACTACTATTG CTGGTATTAC TGTTACCAAC
CCTAATCAAC GAGGTACGGC TGTTTGGGTA GAGTCAAGTA GTCCAACTAT CAAAAACAAT
ACTTTTACTG ACAGCATCAG AGACGGTGTT TTCGTTACAG GTACAGGTAA TCCCAAAATT
GAAAACAACC TTTTTATCAA AAACCGGGGT AATGGGATTT CAATAACTAA ATATGCTCAA
GGTGAGATAC GCAACAACTC ATTTGAAGAT ACTGGTTTTG GTTTGGCTAT TGGTGGTAGT
TCGACACCCT TGGTAGAAGG AAACCAAATT CTTCAAAACC AAGACGGTAT ATTTATCTCC
GAATCTGCTA AACCTATTTT GCGTAAGAAT GTCATTCAGA ATAATAGGCG CGATGGTATT
GTCGCAACTA TTGACGCTCT ACCCAATCTT GGTACTAATG ACAATCCTGG TAGTAATCTC
ATCCGTAATA ACACTCGTTA TGACTTGAAT AATTCTACTA AGGTTAACAG GATTGTTGCT
ATTGGCAACG ATTTTGATCA AAAGCGGATT TTTGGCGCAG TAGATTTTGT GGCTGCAACT
GTTAACCCTC CTACAGGTGG AGGTACTACA GGTTCTACCG GTTTTCAGGA TGTACCAACA
GGTTATTGGG CAAAAGCCTA CATTGAAGCT TTGGCTTCCC AAAATATTAT TGCGGGTTTT
CCTGATGGTA CTTTTAAGCC TAATGATCCT GTAACTCGCG CTCAATTTGC TACCATTATA
ACCAAAGCTT TGGCACCACC GTCTAGACGG ACAGCAATTC GATTTAACGA TGTAAATAGC
AATTTTTGGG CTTATGGAGC AATTCAATCA GCTTACCAAA GTCAATTTGT GGCTGGGTAT
CCTGATGGTA CTTTTAAACC ACAGCAACAA ATTCCTAGAG TTCAGGCTTT AGTTGCTCTA
GCTAATGGTT TAAACCTTAC TGCCAACAAT GAAAGTATTC TTAGTTTTTA CACAGATGCT
GCTCAAATCC CTAATTATGC AATGGGATCT GTTGCTGCTG CAACAGTCAG GCAATTAGTG
GTTAACTATC CCACTGTAAA ATTACTTGAT CCCAATCGTG AAGCTACTAG AGCAGAAATT
GCAGCTTTTG TTTATCAAGC ACTTGTCACT ATTGGACGGG CGCAACCAAC ACCTTCTCCT
TATGTGGTAA CGGCTCAGTA G
 
Protein sequence
MKYSGFHIST LSLGASLTAL LVIASSSMLL TSQVNAGSPP TVIAQIPASA TVIYVNPVIG 
QDSSSAGITP EAPYKTITFA LSQAKSNTVI KLAPGTYTKD TGETFPLLLK LGVILVGNES
IKGQGTVIIG GGHYISRTFA RQDITILAEN TTIAGITVTN PNQRGTAVWV ESSSPTIKNN
TFTDSIRDGV FVTGTGNPKI ENNLFIKNRG NGISITKYAQ GEIRNNSFED TGFGLAIGGS
STPLVEGNQI LQNQDGIFIS ESAKPILRKN VIQNNRRDGI VATIDALPNL GTNDNPGSNL
IRNNTRYDLN NSTKVNRIVA IGNDFDQKRI FGAVDFVAAT VNPPTGGGTT GSTGFQDVPT
GYWAKAYIEA LASQNIIAGF PDGTFKPNDP VTRAQFATII TKALAPPSRR TAIRFNDVNS
NFWAYGAIQS AYQSQFVAGY PDGTFKPQQQ IPRVQALVAL ANGLNLTANN ESILSFYTDA
AQIPNYAMGS VAAATVRQLV VNYPTVKLLD PNREATRAEI AAFVYQALVT IGRAQPTPSP
YVVTAQ