Gene Aazo_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1971 
Symbol 
ID9339764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2051434 
End bp2053446 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content42% 
IMG OID 
Producttype II secretion system protein E 
Protein accessionYP_003721172 
Protein GI298490995 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTACT CGTCACCACA ACGGACTAGC ACCGCGTTAA CTACAAAAAC ACAGTTTTCG 
CCCTTTGGCA ACCACCTAGT GCAATCTGGC TATGTCAACA CTGAACAGAT GAAACAGGCA
CTAATAGAGA GCCGGAAATC TGGCAAGAAG CTGATAGATG TCCTAGAGTT AATTACTGGG
CAACAACTAT CAGCAGAGTT TATTCGGGAA CACAAGAAAC AACATCTATT TGAACTAAAA
ATATTATATG GCGTTGAATC AGTAGATCCA GAAGTCAATC AAATTGGCAA TCTGATGGTT
GGACAATTGA TTGATAGTCT GATACCAGTC GATGTCTGTC GTCGTCATCG TTTGGTACCG
CTATGGAAAC GGGAAGACCA AATACCACCT TATGTTTTAG TGGCAATGGT GGAACCAGAT
AATCTGGATG CTTCTGATGA CCTGAACCGA ATCTTGCGTA ACAAAAACTT ATCCTTACAG
CGGATGGTGA TTACCCAAGA AGATTACCAG CATCTCATCA ACAAATATTT AGATGAGTTG
GCTATTCGGG AAAAGGAAAA AGAACAAAAA AGAGATACTG ATATTAATCA GGATATAAAA
TATCTGGAAG ATCAGGATAT AGAAGAATTT GGTGAGGACA AGGATGCTGA TATCAATGCA
TCAATTAAGG ATGCTGAAGA TGCACCCATT ATCAAACTCG TCAACAGAAT CCTGTTTAAA
GCTTTGCAAG AAAAGGTTTC GGATATTCAT ATCGAACCAC AAGAGGAAAA CTTACGCATT
CGCTTCCGTA AGGATGGGGT ACTGCATGAG GCTTTTGACC CTGTACCTAA AAAAATCATT
CCGGCGGTGA CAGCCCGATT TAAAATCATT TCCAATCTTG ACATCGCGGA AAGGCGTTTA
CCTCAAGATG GACGCATCCG TCGTTTATTT GAAGGACGGA AAGTTGACTT CCGGGTGAAT
ACCTTACCCA GTCGCTATGG GGAAAAGGTG GTGTTACGGA TTCTGGACAA CTCTTCTACC
CAACTAGGGT TGAATAAGTT AATTACTGAT CCAGAGACTT TGCATATTGT CCAGGATATC
GTCAGCAAGC CCTTTGGCTT GATTTTGGTA ACTGGTCCGA CTGGTTCTGG TAAAACGACT
TCGTTGTATT CGGCATTGGC AGAAAAGAAC GATCCAGGAA TTAACATTAG TACGGTAGAA
GACCCGATTG AATATAGTTT GCCAGGGATT ACCCAAGTAC AGGTAATTCG GGAAAAAGGG
TTGGATTTTT CCACAGCATT GCGGGCGTTC TTGCGTCAAG ATCCCGATGT GTTGCTTGTG
GGTGAAACAC GGGATAAAGA AACGGCAAAA ACAGCGATTG AAGCGGCTTT AACAGGTCAC
TTAGTATTAA CTACCTTACA CACTAATGAT GCGCCTGGTG CGATCGCACG TCTAGGAGAA
ATGGGAATTG AACCCTTCAT GGTTTCCAGT TCCCTAATTG GTGTTTTGGC ACAACGTTTA
GTGCGTCGAG TTTGTTCACA ATGTCGAATT CCCTACACTC CCACTACTGA AGAACTGGCT
CGTTATGGTC TGTCAGCATC CAAAGAAACA TCAGTAACTT TTTACAAGGC TAATTCCTTA
CCTCCAGAAT CTATAGCAGA AGCTAAAACC AAAAATGAAA TCTGTGGGGC TTGTAATGGC
ATTGGGTATA AAGGACGTTG TGGTGTTTAT GAAGTTATGC GAGTCACAGA AAACCTCCAA
ACTCTCATCA ACCAAGAAGC ACCCACAGAA CGCATTAAAG AAGTAGCTGT AGAAGAAGGT
ATGAAAACCT TGCTTGCTTA CAGTCTAGAT TTAGTGCGTC AAGGTTCTAC AACTCTGGAG
GAAGTAGAGC GCGTAACCTT TACTGATACA GGTTTGGAAG CTGAATTAAA AGCCAAACGT
AAGAGTGGTC TAACTTGTAA AACTTGTGAT GCCACATTAC AACCAGAATG GCTCGATTGT
CCCTACTGTA TGACACCTCG TTTTCAAGAT TAG
 
Protein sequence
MTYSSPQRTS TALTTKTQFS PFGNHLVQSG YVNTEQMKQA LIESRKSGKK LIDVLELITG 
QQLSAEFIRE HKKQHLFELK ILYGVESVDP EVNQIGNLMV GQLIDSLIPV DVCRRHRLVP
LWKREDQIPP YVLVAMVEPD NLDASDDLNR ILRNKNLSLQ RMVITQEDYQ HLINKYLDEL
AIREKEKEQK RDTDINQDIK YLEDQDIEEF GEDKDADINA SIKDAEDAPI IKLVNRILFK
ALQEKVSDIH IEPQEENLRI RFRKDGVLHE AFDPVPKKII PAVTARFKII SNLDIAERRL
PQDGRIRRLF EGRKVDFRVN TLPSRYGEKV VLRILDNSST QLGLNKLITD PETLHIVQDI
VSKPFGLILV TGPTGSGKTT SLYSALAEKN DPGINISTVE DPIEYSLPGI TQVQVIREKG
LDFSTALRAF LRQDPDVLLV GETRDKETAK TAIEAALTGH LVLTTLHTND APGAIARLGE
MGIEPFMVSS SLIGVLAQRL VRRVCSQCRI PYTPTTEELA RYGLSASKET SVTFYKANSL
PPESIAEAKT KNEICGACNG IGYKGRCGVY EVMRVTENLQ TLINQEAPTE RIKEVAVEEG
MKTLLAYSLD LVRQGSTTLE EVERVTFTDT GLEAELKAKR KSGLTCKTCD ATLQPEWLDC
PYCMTPRFQD