Gene Aazo_3874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3874 
Symbol 
ID9341678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3927270 
End bp3928436 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content39% 
IMG OID 
ProductNHL repeat containing protein 
Protein accessionYP_003722506 
Protein GI298492329 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAATC ATCTGACTCA AGATATAGAC CCTTTATCCA TATTTCCCAA CGGTGCAAAA 
ATAATATTAG GAAACAATAT TACATCCGAA CAAATAGCCA TACCTCTTGC ACCAAGTCCA
ACAACAATGT TTGGACCCCG TGCTGCTTGT TTATTATCAC CAACTGGACC ATTATGGGTA
TCAGATACAG GACATCATCG TTTATTAGGT TGGCGAAATT TACCCACAGA AGATAATCAA
CCGGCTGATT GGGTAATAGG ACAACCTGAC TTTTATCATG AAGGACAAAA CGCCAAAGGT
ACACCTGGAA AATCTACAGT TAGTGTCCCT ACAAGTATTT GTAAATGTGG TGCAGGTTTA
GCTGTTGCTG ATGCTTGGAA TCATCGGGTC TTAATTTGGC ATAATGTACC GGAAGATAGC
AATTTTCCCG CAGATTTAGT ATTAGGACAA GCTAATTTTA CCGATAACGA ACATAACCAA
GGTAGTCAAC AACCTGCGGC AAATACTTTA CATTGGCCCT ATGGTGTTTT CTATCATCAA
GGTAAGTTAT TTGTAGCCGA TACTGGAAAT CGCCGTTTGT TAATTTGGAA TCAATTTCCT
ACAGAAAATG GACAACCAGC GGATATAGTT TTGGGACAAC CAGACATGAT ATTTCGTAAT
GAAAATGGTG GTGGTTCTCC CACTGCTTCT AGTATGCGCT GGTGTCATGA TATTACTCTT
TGGGATAATA ATTTAGTTGT CACCGATGCG GGTAATAACC GGGTGATGAT TTGGGATGGT
ATACCGACAG AAAATAATGC CCCTTGTGCG GTGGTCTTAG GTCAGAAAAA CTTCAATTTT
GTGGAATTAA ATCAAGGTGT ATATTTTCCT ACTGGCAGTA GCCTAAGTAT GCCTTATGGG
GTAGATACTG CGGGAGATTG GTTAATAGTT GCAGATACGG CTAATTCTCG TTTGCTAGGA
TGGAAGAAAC GAGAATCTAT TTTGTTATTA CAGGGTGGAT ATGCTGATGG TGTAGTGGGA
CAAGATAGTT TTAAAAGTAA GAGTGAAAAT CAGAATTTTG GACCGCCAAC GCGACGAAGT
TTAAATTGGT GCTATGGAAT TAAAGTTTGT GGTGAAATTG CGGTAATTTC TGATTCTGGC
AATAATCGAG TTTTGATTTG GAGATGA
 
Protein sequence
MLNHLTQDID PLSIFPNGAK IILGNNITSE QIAIPLAPSP TTMFGPRAAC LLSPTGPLWV 
SDTGHHRLLG WRNLPTEDNQ PADWVIGQPD FYHEGQNAKG TPGKSTVSVP TSICKCGAGL
AVADAWNHRV LIWHNVPEDS NFPADLVLGQ ANFTDNEHNQ GSQQPAANTL HWPYGVFYHQ
GKLFVADTGN RRLLIWNQFP TENGQPADIV LGQPDMIFRN ENGGGSPTAS SMRWCHDITL
WDNNLVVTDA GNNRVMIWDG IPTENNAPCA VVLGQKNFNF VELNQGVYFP TGSSLSMPYG
VDTAGDWLIV ADTANSRLLG WKKRESILLL QGGYADGVVG QDSFKSKSEN QNFGPPTRRS
LNWCYGIKVC GEIAVISDSG NNRVLIWR