Gene Aazo_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1983 
Symbol 
ID9339776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2062534 
End bp2064123 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content44% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003721182 
Protein GI298491005 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATAG GCTACGTTGC GCTTGTACTC CATGCCCACT TGCCCTTCGT TCGTCACCCA 
GAAAGTGACT ACGTGCTGGA GGAGGAATGG CTATATGAAG CCATTACCGA AACTTACATC
CCCTTATTGA AAGTATTTGA AGGCTTAAAG CGAGATGGCA TCGACTTTAA AATCACGATG
AGTATGACAC CGCCCCTAGT GTCGATGCTG CGTGATCCTT TACTGCAAGA ATGCTATGAT
GCACATCTAG CTCAACTAGA AGAACTAATA GAACTAGAAG CGGAGCGTAA TATTCATAAT
GGGCATTTGC GCTATTTAGC TGAACATTAT GCCACTGAAT TTAATGAAGC CCGTAAGGTT
TGGGAAAAAT ACGACGGTGA TTTAGTTACA GCTTTTAAAC AGTACCAAGA CTCTAATCAC
TTAGAAATTA TCACCTGCGG TGCTACTCAT GGGTACTTGC CGCTGATGAA AATGTATCCA
CAAGCTGTTT GGGCACAGAT TCAGGTGGCT TGTGAACATT ACGAGCAAAC TTTTGGACAA
GCACCCAGAG GCATTTGGTT GCCAGAATGC GCCTACTATG AAGGCGTAGA GCGAATGTTA
GCAGATGCCG GATTACGCTA TTTCCTCACT GATGGACATG GCATATTATA CGCCCGTCCC
CGTCCCCGCT ATGGTACTTA TGCCCCGATT TTCACAGAAA CTGGTGTAGC TGCTTTTGGT
CGGGATCATG AATCTTCTCA ACAGGTATGG TCTTCGGAAG TAGGCTATCC TGGTGCAGCA
GAATACCGGG AATTTTACAA AGATTTGGGC TGGGAAGCTG AATATGAATA TATTAAGCCC
TACATTATGC CCAATGGTCG ACGTAAGAAC ACGGGAATTA AGTATCATAA AATTACCGGA
CGTGGTTTAG GTCTTTCTGA TAAGGCGCTT TATGATCCTT ACTGGGCAAA GGAAAAAGCG
GCAGAACACG CTGCTAACTT CATGTATAAC CGTGAACGCC AGTCTGAACA TTTACATGGA
ATTATGGGTC GTCCCCCGGT AATTGTTTCT CCCTATGATG CAGAGTTATT TGGTCACTGG
TGGTATGAGG GTCCATGGTT CATTGATTAT TTGTTCCGTA AGTCATGGTA TGACCAAGGA
ACTTATGAAA TGACCCACTT AGCTGATTAT TTGCGCGCTA ACCCGAGACA ACAAGTCTGT
CGTCCTTCCC AGTCCAGTTG GGGTTATAAG GGGTTCCATG AATATTGGTT GAACAATACA
AATGTCTGGA TTTATCCGCA TTTACATAAA GCTGCGGAAC GGATGATTGA AATTTCCAAA
TTGGAACCAG AGGATGAGTT GCAATGGAAA GCTTTGAATC AAGCAGCGCG GGAGTTATTA
TTAGCACAAT CTTCTGACTG GGCGTTTATT ATGCGGACAG GAACAATGGT TCCCTATGCG
GTGAGAAGAA CGCGATCACA CTTGATGCGG TTTAATAAAC TGTATGAAGA TGTGAAAATT
GGCAAAGTTG ATAGTGGTTG GTTGGAAAAA GTAGAATTAA TGGATAATAT TTTCCCCAAC
ATTAACTATC GAGTTTACCG TCCTATGTAG
 
Protein sequence
MAIGYVALVL HAHLPFVRHP ESDYVLEEEW LYEAITETYI PLLKVFEGLK RDGIDFKITM 
SMTPPLVSML RDPLLQECYD AHLAQLEELI ELEAERNIHN GHLRYLAEHY ATEFNEARKV
WEKYDGDLVT AFKQYQDSNH LEIITCGATH GYLPLMKMYP QAVWAQIQVA CEHYEQTFGQ
APRGIWLPEC AYYEGVERML ADAGLRYFLT DGHGILYARP RPRYGTYAPI FTETGVAAFG
RDHESSQQVW SSEVGYPGAA EYREFYKDLG WEAEYEYIKP YIMPNGRRKN TGIKYHKITG
RGLGLSDKAL YDPYWAKEKA AEHAANFMYN RERQSEHLHG IMGRPPVIVS PYDAELFGHW
WYEGPWFIDY LFRKSWYDQG TYEMTHLADY LRANPRQQVC RPSQSSWGYK GFHEYWLNNT
NVWIYPHLHK AAERMIEISK LEPEDELQWK ALNQAARELL LAQSSDWAFI MRTGTMVPYA
VRRTRSHLMR FNKLYEDVKI GKVDSGWLEK VELMDNIFPN INYRVYRPM