Gene Aazo_3976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3976 
Symbol 
ID9341780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4041628 
End bp4043433 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content37% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003722590 
Protein GI298492413 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.671278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGTT TGTTTGGTAA TTGGGCCAAC ACCCTGAGAA AAAATTCTCT ATTGCTGGTA 
CTTTTAATGC TGCTGCCAAC ATTGGGAATT AGTAATTCTG TCATGGCAGC CGAGAAAATC
TATGCTTCTT ATTCTATACT AGAAATTCCC ATTCCCGTGA TTAGTTTGGA AACTTATGTA
AAAACGGGAG TAATTGATGA TGAGTTAGCA ATTTCTAAAC AGTATATTTC ATTTAAAAAA
CTGCAAGAAT TACGACAAAT TTTACTAAGG TCTGTAAAAA TTAGTCCGGC TATTGCTGCA
CGATTTCTCC ACACCCAACA AGGAGAATTC CTACTGCGAC GTCTAGCAGA AGTAATAAAA
ACTAAATCTT TGGTACCTGA ATCAGAATTC AATGCTTTAC GTGATGCGAT CATTGCTGCT
TCTGGTGAAC CGGAAGGCTT GAGTTTATTG AATTTGTTAC GGAACTATCC CAGCAGCACT
GTTCATCTTA AGTTAGGTGA TAGTTTAGAA ATAGCTGGTA AACTAGAACA ACTAATTAGT
GAAACTGATA AAGCAATCGC AGCAATTCAA CAATTATCAA ATATCGAAGC TGGTAAGATC
CGAAATGTAA ATCTATCGCC ATTGCCAGAT TTACAAACTG AAGGAAATTT TCAATCAAAT
AAGTACACGC TGGAATTTTT TGACTCTACC CGTAATCGCC GTTTGTTTAC AGATGTTTAT
ATTCCTAACG TTCCCCACCC CACACCAGTA ATTGTGATTT CTCATGGCTT AGGTTTAGAC
AGCAGTAACT TTCGTTATTT AGCTCATCAT TTGGCTTCCC GCGGATTAGC TGTTGTCGTT
CCTAATCATC CAGTTAGTCA GGATCAACAG CCACAAAAGC AATTTTTCAT CAAGAGAAAT
ACGCGTAAAG TTATAGAGGC TGGTGAATTT TTAGATCAAC CTTTAGATAT AAAATACATA
TTAGATCAAC TCCAAAACTC TAACCAATAT GATCCAATTT TTAAAGGTAA ATTAAACTTG
GAACAGGTGG GAGTATTTGG TCAATCTTTT GGTGGTTACA CTGCTCTAGT TTTAGCAGGT
GCAAAGATTA ATTTCGAACA GCTAGAACAA GATTGTCAAC CAGATGTGCT GCGAGATACC
TGGAATATGT CTTTACTGCT GCAATGTCGC GCTTTAGAAT TACAAACGAA GTCCAGACAA
AAATATAATT TTAATTTACG GGATGAAAGA GTTAAAGCTG CAATTGCTGT TAATCCCATT
ACTAGCTCTA TTTTCGGTCA AGTTGGCTTG AATCAAATTC AAACTCCTGT CATGTTTGTT
AGTAGCAGTG AAGATACTGT TGCACCAGCT TTATATGAAC AAATTTTACC GTTTTCCTGG
TTAACGCATC CCCATAAATA TCTGGTTATG CTTGTGGGTG GAACTCATTT TTCTAGTATT
GGTAATAGTA ATACTGGTAG TCAGCAAGTC CGATTACCTA CAGATATGAT TGGTAATGCT
TCCCAAGCGC GTAGTTATAT AAATGCGTTT AGCTTGTCGT TTTTCCAAAC GTATGTTTCT
CAAAAGCCAC AATATATTCC CTACCTGAAT GCGGGTTACG CTAAAACTAT TTCTAGTCAG
TTTTTGGGGT TGAGTCTTGT GCAGTCTTTG AATAGTCAAG AATTCGCCTC AGTATTAGGT
AGTAATATTC AGGAAGTGAT CCCCGGGAAA AAAACTTCCT ACAACCATAA TCAGGTTTGG
ATTTTGGATT TGAGATATTG GTGTATCCTT GCTTCATGTC ATGATTTTTT TGTGATGTAT
TTTTAA
 
Protein sequence
MNSLFGNWAN TLRKNSLLLV LLMLLPTLGI SNSVMAAEKI YASYSILEIP IPVISLETYV 
KTGVIDDELA ISKQYISFKK LQELRQILLR SVKISPAIAA RFLHTQQGEF LLRRLAEVIK
TKSLVPESEF NALRDAIIAA SGEPEGLSLL NLLRNYPSST VHLKLGDSLE IAGKLEQLIS
ETDKAIAAIQ QLSNIEAGKI RNVNLSPLPD LQTEGNFQSN KYTLEFFDST RNRRLFTDVY
IPNVPHPTPV IVISHGLGLD SSNFRYLAHH LASRGLAVVV PNHPVSQDQQ PQKQFFIKRN
TRKVIEAGEF LDQPLDIKYI LDQLQNSNQY DPIFKGKLNL EQVGVFGQSF GGYTALVLAG
AKINFEQLEQ DCQPDVLRDT WNMSLLLQCR ALELQTKSRQ KYNFNLRDER VKAAIAVNPI
TSSIFGQVGL NQIQTPVMFV SSSEDTVAPA LYEQILPFSW LTHPHKYLVM LVGGTHFSSI
GNSNTGSQQV RLPTDMIGNA SQARSYINAF SLSFFQTYVS QKPQYIPYLN AGYAKTISSQ
FLGLSLVQSL NSQEFASVLG SNIQEVIPGK KTSYNHNQVW ILDLRYWCIL ASCHDFFVMY
F