Gene Aazo_4712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4712 
Symbol 
ID9342519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4816804 
End bp4818162 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content39% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003723035 
Protein GI298492858 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.184255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGGTT TTCCTCAGTT AATTTGGACA GATGTGGGGT TACGATTGTT GTCAGTGCTA 
TTGCTGATTG CAATTAATGC CTTTTTTGTC ACGGCGGAAT TTTCAATGGT AACGGTGCGG
CGCACTCGGA TTCATCAGCT GGTTCAGGCT GGTGATATAC CTGCGATCGC AGTGGAAATG
TTACAACGTA GTATTGACAG GTTGCTATCT ACGGCTCAAT TAGGTATTAC CCTATCTAGT
TTGGCACTAG GTTGGATTGG AGAAAGTACA ATTGTTGTGC TGATGGAAGA ATGGTTAAAA
TCCTGGACTG TACCCATCAG TTTAAGTAAC GTTCTGGCAC ATTCTCTCTC AGTTCCCATC
ACCTTTTTTT TAATTGCTTA TTTACAAATT GTTTTAGGAG AATTGTTTCC TAAATCAGTA
GCTATGTTGT ATTCAGAAAA ACTGGCAAGG TTTTTGGGTC CTTCTGTCAA AGCTATTGTT
CGTTTTTTCA GTCCTGTGAT TTGGATTCTC AACCAATCCA CACGCTACCT ATTAAGATTA
TTTGGGATTG AATACACTGG TCAGAGCTGG CGACCTCCTG TAACTCCGGA AGAATTGCAA
TTAATTATCT CAACAGAACG AGAATCTACC GGTTTAGAGT TATCAGAGCG AGAATTACTC
AATAATGTTT TTGAATTTGG GGATATAACC GCTGAAGATG TCATGATTCC CCGTACTAGC
ATTATCGCTT TACCAGAAGA TGCTAGTTTC CACACCTTAC TACAAGAAAT GATCTTAACA
GGGCATTCCC GTTATCCCAT TATTGGTGAA TCTTTAGACG ATATTTGCGG TATTGTTTAT
TTTCAAGATT TAGCAAGACC TTTAGCTACT GGAAAACTGA ATTTAGAAAC ACAAATTCAA
CCTTGGATGC GTTCTCCTCG CTTTGTTCCA GAACAAACTC TTTTGAGTGA ACTTTTGCCA
ATGATGCAGC AAGAAAAACC AGCTATGGTG ATTGTGGTGA ATGAATTTGG TGGTACTGTG
GGATTAGTTA CAATTCAAGA TGTAATTGCA GAAATTATCG GTAATGCCGG TGAACCAGGA
ATTAGTGATG ACTTACTAAT TCAAATGTTA GATAAGCAAA CATTTTTAGT ACAAGCACAA
GTGAATCTGG AAGAACTCAA TGAGGTCTTA CATCTCAATT TACCTCTGAT ACGAGAATAT
CAAACATTAG GAGGATTTGT ACTCTACCAG TGGCAAAAAA TCCCCGCTAA AGGCGAAATA
TTCCACTATG GTAATCTTGA ATTCACTGTA ATATCAGTTA TCGGACCACG CTTGCACCAA
ATTCAAATCA GAAGGTTACT AGATGAATGT TCAGCTTAA
 
Protein sequence
MSGFPQLIWT DVGLRLLSVL LLIAINAFFV TAEFSMVTVR RTRIHQLVQA GDIPAIAVEM 
LQRSIDRLLS TAQLGITLSS LALGWIGEST IVVLMEEWLK SWTVPISLSN VLAHSLSVPI
TFFLIAYLQI VLGELFPKSV AMLYSEKLAR FLGPSVKAIV RFFSPVIWIL NQSTRYLLRL
FGIEYTGQSW RPPVTPEELQ LIISTEREST GLELSERELL NNVFEFGDIT AEDVMIPRTS
IIALPEDASF HTLLQEMILT GHSRYPIIGE SLDDICGIVY FQDLARPLAT GKLNLETQIQ
PWMRSPRFVP EQTLLSELLP MMQQEKPAMV IVVNEFGGTV GLVTIQDVIA EIIGNAGEPG
ISDDLLIQML DKQTFLVQAQ VNLEELNEVL HLNLPLIREY QTLGGFVLYQ WQKIPAKGEI
FHYGNLEFTV ISVIGPRLHQ IQIRRLLDEC SA