Gene Aazo_4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4067 
Symbol 
ID9341872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4127200 
End bp4129443 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content33% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003722646 
Protein GI298492469 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCCAG ATAATTGGCA AGAACAAGCA CAAAAATATC TTATAGATGA AAATTACAAT 
GCAGGTGTAA AACTTTACGA AGAAGCAATA GAAAGAGAAA CTAATATTAA GAGTTATTAT
TGGTATTTAG GATTACTTTT ATTATTACAG AATCAGGAAA CCGAAGCACA AATTACTTGG
TTTACTGCCT TATCAGAATC AGAAGAAATA GATTTTGATA CGCAAGAACT ATTAGAAATT
TTAGATAAAG AAGCCAAGCG TCGCATCACA ATTGAAGATT ATCATGTTGC TTGGGCAATC
CGTCAGCATA TCAAAGAAAT TGTATCTGAA GATATTAATA ACTTACTGCA TCTTGTTTCC
ATTGTCATTA GTTCAGAAAC ATTCAATCAT GATATTGCAG AGTTATTAGA ATCAGTCCGT
CATTCTTTAT GTATTAGCAC AGATAAATGT GATACCAATT TATTGGTAGA TGTTATAAAA
ACAATCTTAG TTTATACAAG TAAACGGAAA GCAGCAATTA ATTCAGTTTT TGATTTTATT
AAAGTTTGCT TACACAGCTA TTCAGATAGT TCTGTGATGA TTATTATTGA AACTGTAATG
ACCCAATGCA TCAAATTATC AGCTTTTTAT AGAAGACCCG ATTTAGCAGC AAAATTTGCA
GAACTTTGTT TGGGGAGAGT TCCCAAAAAT TACCATCTTC AACATAGAGA GATTTTAACA
CTGATATCAT ATTTTTACCA AGACAACCAA CAATATGATA AAGGTATAGA AAGTGCCAAA
CTATGTTATG AACTAGGAGA TTCTGTAGCA GAAAAAATTT GTGCTAACCA TATTGTTCTC
AGAGGATTCA TGACTTCTGG TAGTTACTGG AAAGAAGCTT ATGATCATTT GCAAGAGCAG
ATTTTATTAA CTGAACAGTT AGTAAAAAAC CAACCTGATG TTGCTCACAT AATAGATATT
ACCAGGTTAT CGAATGCGTT GTCTTTTCTT CCCTATTTTG AGGATAAACC TGAAAGAAAC
CACATACTTC AACATCAGAT ATCTAGTTTT TGCCAATCTC ATTTTATTAA GCAACATACA
GAGATAGTAG AACAAAATTA TAAAAACTTT CATTTACGGC GACAGAATTT CACCCTAAAT
AAAGAAACTT TGAAGGTTGG TTATATTTCC CATTGTTTTA AGCGACATTC TGTTTCTTGG
TTATGCAGAT GGATTTTTAA ATATCATAAC CAAGAAAAAT TTAAAATACA TTGTTACTCT
TATGATGACT CTGAAGGAAT AGACAGTTTT ACCCAGAGGT ATTTTGTTGA TAAATCTTAT
AAATTTTATA AATTTGGACT TCGTCAAACT TATAATCTTT CGAATCAAAT TCAAGAAGAT
GAAATTGATA TCTTAGTTGA TTTAGATAGT TTAACTTTAG ATCAAGTCTG TGAATTATTA
TCAATCAAAT CTGCTCCCAT ACAGGCAACT TGGCTGGGTT GGGATAGTTC CGGTTTACCA
TCAATTGATT ATTTTATTGC TGATCCTTAT GTTTTACCAG AGTCAGCACA GGATTATTAT
AGTGAAACTA TTTGGAGATT ACCACAAACA TATATAGCGG TTGATGGATT TGAGATAGAT
GTACCTGATT TACGACGGGA AGATTTAAAT ATTCCTAGAG ATGCCATTAT TTATTTCATG
ACTCAAAAGG GATATAAAAG GCATCGCCCC CATTTACGTT TACAGATGAA AATTATCAAA
GAGGTACCTA ATAGTTATTT GCTGATTAAA GGGGATGCTG ACCCAGAAGC AAGTAAGGTA
TTTTTTGAAG AGATTGCACA GGAAGAAGGT GTAGATTTTA ATCGGATAAA ATTTTTGCCT
TATGCTCGTA GTGAAGCTGT CCATCGAGCT AATTTGCAAA TTGCTGATGT GGTTTTAGAT
ACTTATCCCT ATAATGGAGC AACGACAACA TTAGAAACCC TTTGGATGGG TGTTCCTTTG
GTAACAAGAG TTGGGGAACA ATTTTCTGCA CGTAATAGCT ATGGGATGAT GATTAATGCT
GCTATTACAG AAGGTATTGC TTGGACTGAA GATGAGTATG TAGAGTGGGG TGTGCGTTTG
GGTAAGGATG AGAAATTACG ACAACAAATT TCTTGGCAGT TGCGTCAGTC GAGACATACA
GCACCGCTAT GGAATGCCAA GCAGTTTACC CGTGATATGG AAACAGCTTA TGAGCAGATG
TGGCAAAGGT ATATTGATAG TTAG
 
Protein sequence
MIPDNWQEQA QKYLIDENYN AGVKLYEEAI ERETNIKSYY WYLGLLLLLQ NQETEAQITW 
FTALSESEEI DFDTQELLEI LDKEAKRRIT IEDYHVAWAI RQHIKEIVSE DINNLLHLVS
IVISSETFNH DIAELLESVR HSLCISTDKC DTNLLVDVIK TILVYTSKRK AAINSVFDFI
KVCLHSYSDS SVMIIIETVM TQCIKLSAFY RRPDLAAKFA ELCLGRVPKN YHLQHREILT
LISYFYQDNQ QYDKGIESAK LCYELGDSVA EKICANHIVL RGFMTSGSYW KEAYDHLQEQ
ILLTEQLVKN QPDVAHIIDI TRLSNALSFL PYFEDKPERN HILQHQISSF CQSHFIKQHT
EIVEQNYKNF HLRRQNFTLN KETLKVGYIS HCFKRHSVSW LCRWIFKYHN QEKFKIHCYS
YDDSEGIDSF TQRYFVDKSY KFYKFGLRQT YNLSNQIQED EIDILVDLDS LTLDQVCELL
SIKSAPIQAT WLGWDSSGLP SIDYFIADPY VLPESAQDYY SETIWRLPQT YIAVDGFEID
VPDLRREDLN IPRDAIIYFM TQKGYKRHRP HLRLQMKIIK EVPNSYLLIK GDADPEASKV
FFEEIAQEEG VDFNRIKFLP YARSEAVHRA NLQIADVVLD TYPYNGATTT LETLWMGVPL
VTRVGEQFSA RNSYGMMINA AITEGIAWTE DEYVEWGVRL GKDEKLRQQI SWQLRQSRHT
APLWNAKQFT RDMETAYEQM WQRYIDS