Gene Aazo_4775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4775 
Symbol 
ID9342582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4874969 
End bp4878073 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content36% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003723076 
Protein GI298492899 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.527155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGTGG ATTATTTACA CCCCCAACAC CTTGAAGAAC TTGTTGAAGA TAGTAGTATA 
GATTCATACT TAGCACGATT AAATTTTAGA TCGCTGCAAG GTGTGAATGC ATATCAGTAT
CTACTCATAT CCGAACAACT CCCACGCACC AATACTGGCA TGATCAAAAG TTCATGGTTG
CAACGTTACA GCCATATTAC AGCAGGTGGT TGGTGGTGTT CTGGAAGAGA CCCCCTGAAT
AATTGGCAGA AAATGGAATG GGGATGTTTT AAACCAACCC AACCGCGACA AAACAAAGAT
GGTAAGTCTA TTAAATATGA ACATCCCCCA AGCACAGCAA CAAGGGTATT TTGTCTGCGT
GTAACTTTGC AAATTTGGCA GCAAGTTTCC CAACGGTATA ATATTCCCAT GCCTGAAAAT
ATCATCATTA CTGAAGATGG TGAAGCAGAA GGTTTTTGGC AATGGATAAT GGAATGTAAT
ATCTCAATAA TTATTTGTGA AGGTGTGAAA AAGGCTGCCG CTTTGTTAAC ACAAGGTTAT
CCAGCTATTG CCATTCCGGG AATTACTAGC GGTTATCGAG TTGTAAAAGA TGAATTTGGT
AAAGTCACCC GTCGTCAACT CATTCCTGAT TTAGAAGTTT TTGCTACAAG ACAACGAAGT
TTTTATATAT GTTTTGATTT TGAAAACCAA GCCAAAAAGA TGGCTGCTGT TAACAATGCA
ATTTCTCAAC TTGGTTGTTT ATTTCAACAA CAAGATTGTC CTGTTAAAGT TGTGGAGTTA
CCGGGAATAG AAAAAGGTGT TGATGAGTTT ATTGTTGCTA AAGGTGCAGC TAATTTTGAA
ATAATTTATC GTCAAAGTGT ATATTTAGAA ATTTACCTTG CTCAAACTAA ACCTCACGGG
GAATTAACAA TTACTCCTGC ACTCACTTTT AACCAGCCTT ATTTAGAAAA AATCCCCTTT
CCTACTTCTG GCTTGGTAGG AGTGAAATCC CCTAAAGGTA CAGGAAAAAC CACTGGACTG
CAAGCAGTTG TCAATCAAGC TAAAAGTCGT AATCAACCAG TTTTATTAAT TACTCATCGC
ATTCTTTTAG GAAGATTTTT ATGTGAGAAG ATTGCCATTC AATGGGGAAT TAGCCATCAA
GCATGGAGTA TTGAAGAAGA CCCGACATTA CCAATTAGTA GTTATCAATT ACCAATTACT
AAATCCTTCG GTTTATGCAT TGATTCTATT TGGAAACTGA ATCCAGAAGA TTGGCATGGA
GGCATAGTTA TATTAGATGA AGTAGAACAA TCTTTATGGC ATTTACTTAA TAGTAATACT
TGTAAACAGA AGCGGATTAA GATTCTTAGG ATTTTTCAGC AATTGATTGC TACTGTTCTC
ACAACTGGGG GTCTAATAAT TGCCCAGGAT GCAGATTTAA CAGATATTTC TTTAGAATAT
TTACAAGGTT TAGCAGAGAC TAAAATTACG CCTTGGGTGG TTCTTAACCA ATGGAAACCA
CAGCAGGGTT GGGATGTAAC TTTCTATGAT TCCCCTAACC CAACACCTTT AATTCATCAG
TTAGAATTAG ATTTAATTGC TGGACGTAAA TGTTATGTAA CTACTGATAG TCGCACTGGA
AGTTATAGTT GTGAAACTAT TGAACATTAC CTCAAAGAAA GATTACTTAA ATTAAGAAAG
GAATTTCCTA ATACTTTGGT TGTTAGTAGC CATACTACTA ATACACCTGG TCATGCTGCG
GTTGATTTCA TCACGGCTAT TAATCAGAAA ATTACAGATT ATAATACTGT ATTTGTTACT
CCTAGTTTGG GGACAGGAAT TAGTATTGAT GTCCAACATT TCGACCGCGT TTATGGAATT
TTTCAAGGAG TAATTACTGA CTCAGAAGCC CGTCAAGCAT TAGCACGAGT TCGGGATGAT
ATACCCCGTG TTGTTTGGTG TGCCAAACGT GGTATTGGTT TAATTGGTAG TGGTAGTACA
AATTATCAGT TGTTATCTCA TTGGTATCAA GAAAATCAAA AAGAAAATTT AGCTTTGCTG
AGTCCTTTAC ACAAAATAGA TGTAGATCTA CCTATGGTAT ATGACCCTGT GCATTTACGA
ACTTGGGCTA AATTATCAGC TAGGGTAAAT GCTTCTATTC GTCTCTATCG GCAATCATTG
CAAGATGGTT TAATTGCTGA TGGACATCAA GTTATGATGC GGAGTAATAC AGTCCAAAAT
AATATTATTC GGGATTTACG CTTGGCTTTC TTTGCTACTG ATGCTAGTGA TTTAGAAACT
AGAAAGAGGT TAATTGTCGA AATTGTCAAA GTCCAGAAAG ATTGGGTGAA AAGTCGTCAA
AAAGCTAAAG ATATTAAGCG CAAAATCCAG GAAATTAAAC AACACAATCA ATTATTAGCT
GCAAAGGCTG TAGCTAATGC TAGTGATATT GATTATTGTG AATATAATCA ATTATTAAAT
AAACATTCTC TTAGTGATAA GGAACGTAAC CAAATAAATA AATATTTGCT CAGAGATATG
TATGGGATTG AAGTAACTCC TATGCTAACA TTGCGCGATG ATAAGGGTTA TTATGGACAA
TTATTAACTC ACTATTATCT GACCCATGAA AGTGAATATT TCCATGTCAG AGATCAACAA
GAGTGGCATA AACAATTATA TTGGGGTGAA GGAAAGGTCT TTTTACCAGA TTTGAGAACG
TATACTCTCA AAGTTGAAGC TATGCGAGCA TTAGGTATGT TGGAATTCTT GGAAAATGGT
AGAGTATTTA AGGAAAATGA TGCTGATTTG ATTTGGTTGA AGAATGTGGC TGTGCAAAGT
AATAAACATA TTAAACGAGC GTTGGGTATT GATGTGGTAC ATGGGAAGGA AGTAGTTTCT
GGAATTAAAA TCCTGGGCAG ACTCCTAAAT TTACTAGGTT TGAAGTTACA TCACGTAAAT
GATATTTATC AAATTGATTC GCAAACATTA AATGATGGGA GAGGAAATAT ATTTAGTGTT
TGGCAACAAC GTGATGAGTT GCGGTTGTAC CATTTGTATG GTGATAATAC TACAATTTTT
GATGATTCTT TAAATTCCCA GTCAATGGCA CTGCAAGTAA TGTAA
 
Protein sequence
MLVDYLHPQH LEELVEDSSI DSYLARLNFR SLQGVNAYQY LLISEQLPRT NTGMIKSSWL 
QRYSHITAGG WWCSGRDPLN NWQKMEWGCF KPTQPRQNKD GKSIKYEHPP STATRVFCLR
VTLQIWQQVS QRYNIPMPEN IIITEDGEAE GFWQWIMECN ISIIICEGVK KAAALLTQGY
PAIAIPGITS GYRVVKDEFG KVTRRQLIPD LEVFATRQRS FYICFDFENQ AKKMAAVNNA
ISQLGCLFQQ QDCPVKVVEL PGIEKGVDEF IVAKGAANFE IIYRQSVYLE IYLAQTKPHG
ELTITPALTF NQPYLEKIPF PTSGLVGVKS PKGTGKTTGL QAVVNQAKSR NQPVLLITHR
ILLGRFLCEK IAIQWGISHQ AWSIEEDPTL PISSYQLPIT KSFGLCIDSI WKLNPEDWHG
GIVILDEVEQ SLWHLLNSNT CKQKRIKILR IFQQLIATVL TTGGLIIAQD ADLTDISLEY
LQGLAETKIT PWVVLNQWKP QQGWDVTFYD SPNPTPLIHQ LELDLIAGRK CYVTTDSRTG
SYSCETIEHY LKERLLKLRK EFPNTLVVSS HTTNTPGHAA VDFITAINQK ITDYNTVFVT
PSLGTGISID VQHFDRVYGI FQGVITDSEA RQALARVRDD IPRVVWCAKR GIGLIGSGST
NYQLLSHWYQ ENQKENLALL SPLHKIDVDL PMVYDPVHLR TWAKLSARVN ASIRLYRQSL
QDGLIADGHQ VMMRSNTVQN NIIRDLRLAF FATDASDLET RKRLIVEIVK VQKDWVKSRQ
KAKDIKRKIQ EIKQHNQLLA AKAVANASDI DYCEYNQLLN KHSLSDKERN QINKYLLRDM
YGIEVTPMLT LRDDKGYYGQ LLTHYYLTHE SEYFHVRDQQ EWHKQLYWGE GKVFLPDLRT
YTLKVEAMRA LGMLEFLENG RVFKENDADL IWLKNVAVQS NKHIKRALGI DVVHGKEVVS
GIKILGRLLN LLGLKLHHVN DIYQIDSQTL NDGRGNIFSV WQQRDELRLY HLYGDNTTIF
DDSLNSQSMA LQVM