Gene Aazo_2798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2798 
Symbol 
ID9340598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2881832 
End bp2883625 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content36% 
IMG OID 
ProductWD40 domain-containing protein 
Protein accessionYP_003721772 
Protein GI298491595 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATTG ATTGGATTAG TTTACTAAAA GCTCAACAAA CTGACTTCCT CCAAAGAGTT 
AAAAAACCTA AGACTACTGA TTTATCTCTC CTAGAAGGTC AAGTCAAAGG TTGCCACAGT
GAAATTATGG CATTTTGGGG TGAACCATTG GCAAAAATTC TCGAATGTTC TCGTCAACAA
GCGGAAATTT TAGCCAAAAA CCCTCCACCG ATACCACCTG AATATCCAGA CCCTCCAGAT
TGGACAGTTC CCTTTCCTAA ATATTTTCAG AGTCAAGCAG AAGATTATCT GGTGCGTGAA
CAAATTGTAG ATCGGGTAAT TACAGAAAGA TTGGGAAAAC TGGTGAGAAA ATATCATCAT
GACACCTTAG AAAATATGGT GTTAGATGAT GAAGGAAACT TATGTAGTGA AAGTAAGTTT
ACTTATTCGT TGACGAATAA CCCAAAAATT AGTATTCAAG TTCACGTGGC AGATGGAGAA
AGTTTTAACG GTATTAAAAA AGACAAAATC CGGTGGTCAG TCACCCAAGA AGATGTAAAG
AACTATCAAG TCTTAATTTT TCTATGTTTG TTTTATCCAG GAAATACTAA ATTAGGCTAC
GAAAAACAAA CTATTATCAC TGGTTTTTTA CCCACAAATC AACTAGAATT TTCTGAACCT
AAAATCAACT TAACTCCTAG TAGCTTGTTA TATGCAGGAG GATTAACTTG GTATTTAGAA
TCACTAATTG GTAAAAAATA TAACATAACT TTAGTTGATG AATTGATAAT GGTAGAGACA
ATTCAAACCT TACCATCAGA TCATCCCCAA AAGAGTATTA TTGGTGATTG GGAATGCTGG
CAAACATTAA AAGGACACAC CAGAGGTATC AATTGTTTAG CTTATTCTGT GAAAACTCTA
ACAATTAAAG GTTCTGGTTG TAAACACAGT AAAACTGTGC CTATATTAGC TAGTGGTAGT
CGTGGTGAGA CAAAATTATG GGATTTAAGT AAAGGTCAAT TAATTGAGAC TTTATCAGAA
TATCCTTGGG TATTATCAGG ATTAGTAGAT GAAGTTAATT CTTTGGCTTT GAGTGCAGAT
GGACAAACAT TAGTTAGTGT AGGTGCAGAT TCGACAATTA AAATTTGGCA TACTGGTGCA
TTGGATTTAA TTGATATCCT GCACAAGCAT AATGGTAGCG TGCGTTGTGT TGCTTTTACA
CCAGATGGAA AAAAGTTAGC AACTGGTGGA GATGACAGAA AAGTTTTGTT TTGGAATTTG
CGCGATCGCC AAGTTGAAAA TACCTTATGC TTAGATGATA CAGCAGCCCA CTCAATGGTA
GTAAGCCCAG ATGGAAAAAT CTTGATTACA GGCAGTTATC GGAAAATCAA AGTTTGGCAA
TTAACATCTT ATTACAATAA GAAAAACCAG CAAGAAATCA AACCAATACA TACCTTAATG
GGTCATTGTC ATATTGTTAG TTCCCTAGCT ATCAGTGCTA ATAGTGAATT CTTAATTAGT
GGGAGTCAAG ATAAAACAAT TAGAGTTTGG AATTTAGTCA CTGGGCAGTT AATTCACACT
CTCAAAAGTC ATAGAGATGG AGTTTATGCA GTTGTCCTCA GTCCTAATCA ACAAATTATC
GCTAGTGGCA GTGCTGATAA AACTATCAAA TTGTGGCATT TAGAAACGGG AGAACTATTA
GCTACATTTA CAGGCCATGC TAACATAGTG ACAGCTTTGG TATTTACAGC ATCTGGTGAA
ATGTTGGTCA GTGGAAGTTT GGATAAAACC ATTAAAATTT GGCAACGGAG TTAG
 
Protein sequence
MQIDWISLLK AQQTDFLQRV KKPKTTDLSL LEGQVKGCHS EIMAFWGEPL AKILECSRQQ 
AEILAKNPPP IPPEYPDPPD WTVPFPKYFQ SQAEDYLVRE QIVDRVITER LGKLVRKYHH
DTLENMVLDD EGNLCSESKF TYSLTNNPKI SIQVHVADGE SFNGIKKDKI RWSVTQEDVK
NYQVLIFLCL FYPGNTKLGY EKQTIITGFL PTNQLEFSEP KINLTPSSLL YAGGLTWYLE
SLIGKKYNIT LVDELIMVET IQTLPSDHPQ KSIIGDWECW QTLKGHTRGI NCLAYSVKTL
TIKGSGCKHS KTVPILASGS RGETKLWDLS KGQLIETLSE YPWVLSGLVD EVNSLALSAD
GQTLVSVGAD STIKIWHTGA LDLIDILHKH NGSVRCVAFT PDGKKLATGG DDRKVLFWNL
RDRQVENTLC LDDTAAHSMV VSPDGKILIT GSYRKIKVWQ LTSYYNKKNQ QEIKPIHTLM
GHCHIVSSLA ISANSEFLIS GSQDKTIRVW NLVTGQLIHT LKSHRDGVYA VVLSPNQQII
ASGSADKTIK LWHLETGELL ATFTGHANIV TALVFTASGE MLVSGSLDKT IKIWQRS