Gene Aazo_2133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2133 
Symbol 
ID9339928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2212987 
End bp2215698 
Gene Length2712 bp 
Protein Length903 aa 
Translation table11 
GC content39% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003721277 
Protein GI298491100 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0029443 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCAAC AAGAAAAAAA AGATGGTGTA ATCGTTAATC TGGTATTATC GCTGGCCTTA 
GCTACCAGTG GTGTGGTAGC CAACTTATTT GTGTTAGCTC CTACACAGGC AGAATTAAAA
TCTGATTTTA CTACTTTCCC GCTGCCGCAA ACAGTGGAGG ATGGAATCAA AGTGCGAATT
GATGGTTCTG CGAGTTTGGT CATGATTAAC CAAAGCCTAA AAGATACATT TGAGAACCAA
TTTTCTGGTA CACAGATAGA AGTGGGGGTG AATAGTGCTG ATGCTGATGC TGCACTCAAG
ACTTTGCTAG AAGGCAAAAT TGATATAGCT GCCATCGCAC GAGACCTAAC TCCAGCAGAA
AAAGCTCGAG GTTTAGAACA AGTCCATTTG CACCGAGAAA AAATAGCCAT CATCGTTGGT
GCAAATAATC CCTTTCCAGG AAGTTTGACT CCTGAAAAAT TTGCTAAAAT TTTTCGAGGC
CAAATTAAGG ACTGGTCAGA ACTAGGAGTT CAATCTGGTA AGATCCGGCT AATTAATCGA
CCCTCAACAA GCAATACTCA TAATGCCTTT CGTGATTATT CAGTTTTTCA AACTGCTGAG
TTTCCTACAC GAACGAATCC GACTCAAATA GCTGAAGATA AAACTGCCCA AATTATCCAA
CAATTAGGTA CAGATGTCAT TAGCTATGTT ATAGCTAATC AAGTATCAAA GCTACTAGAT
GTGCGAGTTC TGAAAATCAA GGGAGTTACA CCAGGTAATT CTCAATATCC ATTTTCTCAG
CCTTTGGTTT ACGTTTACAA GCAAAATCCC AACCCAGGAG TAGTTGGTTT TCTGAGTCTT
ACCCTTGCAC CTGTAAGAAA AAAGGCGCTA GAACCTCCTA GAGAAGCTGA AGCCTCTGCG
ATCGCAGCCA GTTCTTTACA AAGTGTTAAT CGAGAAACTT TAACAACTTC CTCATCAAAA
CCTCAACCCC TACTAACTGT TGCACCATCT GAAAATTCCA CTATAAGTAC CACTCCACAA
TCACAAACTC CAATCAATAC TCTTGGTTCT GGTAATGAAC AACAGTTTGT GAGTCCTCTG
GAAAATGATC CGCTTGAGGA TAAGAACGTC ATACTCTTAA TAGTTTTATC GCTATTGCCA
ATTTTTGGTT TAGGTGGGTT TCTAACTTGG TGGTTCAAGA GAAAACTGCG ATCAGTAGAT
GAAAAAACAG ATAACTTGGA AACATTAATT TCCAGCACAT CTACCACAGA AACAATATCA
ATCACACCAG ACGATTACAG CATTCTTCCC TATCTTGAAA ATGGTAGCTG CACTAATGGA
ATATCGCATT TAAATCAAAC TACAACTACA ACTTCAATGC TATCCGACAA GGAATATCTT
AACCTTACTC AAGAAGATAA CCTAACAGGT AATTTATTAA CAGCAATTGT CACAGGAACA
AATACCAGAG TAGATCATAC AAATGATTTT GACATTCCTA CTGAAACAAT AGCTGTTGAT
TGTGGTGAAG TAGTATGGGA TACAGAAGCG CCAGTGGCTG TTGTTAATAC ACCTTACCCA
TCAGTACCCA GAATTTCAGG AATTACATTT GATGTAGAAC TGCTAACTTA CGAATTAACC
ACTTCACTAT CAGAATTACT AGATAACCCA GCAGCGCCAT TTCATCAAGA TACCACTACT
CCACTATCAG AAGTAATAGG TTTTCCACCA ATTTCATCCG ATGCTGACTC TAGTACTTCA
CTCTCAGAAT TACTCGGTAT GGCAGCAACC TCTCTTGATA CTGATTCCAG TAATACTCCA
CTAAAATTAC GTCCTGTATC TACAAAAGAG CCTATTACGT CCCTATCAGA ATTATTGGGC
TTACCACCAG AAACATTAGA TTTAGATATA GCACTGAGCA AAGATGAAAC AACAAGTTCA
CTACCTGAAC TATTAGATGA GTTAGGAGAT TTATTCAACA ACTTAGCAGA GGCTGAACTC
AAAATTGATC TGACACCAGA AGAGTTTTCC TCAGACTTGT CTATTTCATC AATGTTCTCA
GAAGAGACTA TTGACTATGC AATCTTGAAA ACAGATGCAA AGATAGAAGT TTCATCAGAG
TTAAAAATAC GAACTAACAT CACAGAATTT GCTAGTTTCT TGGATATAGA CACAGATAGC
AGCATTGTCT TCACACCCCG TACACCTAAG TGGGCTTATG TTTCTTGGTA TGTTTCAGAA
ACTCACAAAG AAGTACTGCG AAAAAAAGGA GGTCGTCTTT TAGCAGTGAG GCTTTATGAT
GCTACTGACA TTGATCTGAG TTATCAAACA CCCCAACTAG TTCAGCAGTA TGAATGTGAG
GAAGCAACTT GCGATCGCTA TATAGATATT CCCACTAGCA ATCGTGATTA CATAACTGAA
ATTGGCTATA CAACAGATAA TAATTGTTGG TTAGGTATAG CTCGTTCAGG TACTATTCGG
ATCTTCAATC CTCCTAGTGA AGATTTCTGG TTTGTCACAG ATACAGAACT AGTTATTCAT
GGATCTACCG AACCAGGAGC AAAAGTGACT ATTGATGATC ATGAAATTGA AATTCAACCT
GATGGAACCT TCAATTTCCG TGTTCCCTTC TCCAATAGTT TACTTCAATA TCTGATGACA
GCAACTGCTG CTAGGGGAGA ACAAACTATC ACCATCCTCA AGAAGTTTTC CCAGGAAAAT
CCAGAAGATT AA
 
Protein sequence
MWQQEKKDGV IVNLVLSLAL ATSGVVANLF VLAPTQAELK SDFTTFPLPQ TVEDGIKVRI 
DGSASLVMIN QSLKDTFENQ FSGTQIEVGV NSADADAALK TLLEGKIDIA AIARDLTPAE
KARGLEQVHL HREKIAIIVG ANNPFPGSLT PEKFAKIFRG QIKDWSELGV QSGKIRLINR
PSTSNTHNAF RDYSVFQTAE FPTRTNPTQI AEDKTAQIIQ QLGTDVISYV IANQVSKLLD
VRVLKIKGVT PGNSQYPFSQ PLVYVYKQNP NPGVVGFLSL TLAPVRKKAL EPPREAEASA
IAASSLQSVN RETLTTSSSK PQPLLTVAPS ENSTISTTPQ SQTPINTLGS GNEQQFVSPL
ENDPLEDKNV ILLIVLSLLP IFGLGGFLTW WFKRKLRSVD EKTDNLETLI SSTSTTETIS
ITPDDYSILP YLENGSCTNG ISHLNQTTTT TSMLSDKEYL NLTQEDNLTG NLLTAIVTGT
NTRVDHTNDF DIPTETIAVD CGEVVWDTEA PVAVVNTPYP SVPRISGITF DVELLTYELT
TSLSELLDNP AAPFHQDTTT PLSEVIGFPP ISSDADSSTS LSELLGMAAT SLDTDSSNTP
LKLRPVSTKE PITSLSELLG LPPETLDLDI ALSKDETTSS LPELLDELGD LFNNLAEAEL
KIDLTPEEFS SDLSISSMFS EETIDYAILK TDAKIEVSSE LKIRTNITEF ASFLDIDTDS
SIVFTPRTPK WAYVSWYVSE THKEVLRKKG GRLLAVRLYD ATDIDLSYQT PQLVQQYECE
EATCDRYIDI PTSNRDYITE IGYTTDNNCW LGIARSGTIR IFNPPSEDFW FVTDTELVIH
GSTEPGAKVT IDDHEIEIQP DGTFNFRVPF SNSLLQYLMT ATAARGEQTI TILKKFSQEN
PED