Gene Aazo_0856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0856 
Symbol 
ID9338644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp911456 
End bp912646 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content44% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003720394 
Protein GI298490217 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.435547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAG CCGCAACTGC AACTACTCAG GGAGACAAAT CTATCGGCGT GGAGGTTATA 
TTTCAATTTC TATTGAAAGA ACTACAGCAG TCAACCAAGG CTACTCACAA GAATTGCCGC
GACGTGGCAT TGCGAATTAC GGGTGAAGTC CTGCGGATTT GCAATGAAAG TAAACGCATT
CAAGCTTCTG GTGACATAGA AAGCTCTGCC ATGACCCTAG CTCGACATCG GCTACAACAA
TGCCTTCGGT ATTATCAGCT GGGGTCGAAT CGTAGCAGGG TGGAACTACA CAGTACATTA
AGTGCCATAA TTTATCGTTA CATTAATCCT CCTCAGAAGC AATTGAGCTA TCAAGGGCGG
CTGACTATCA TAGAAGATTT TCTTCAGAAT TTTTATCTGG AGGCACTAAA TGCTTTTCGT
AGGGAGAACC AACAAGAACC TACCTATCGC CCCCAAACCC TACTAGAATT AGCCGAGTAC
ATGGCATTTA CAGAACGCTA TGGCAAACGT CGTATTCCCT TACCCGGCCG TCAGCAACAG
CTAATTATTC TGCGAGCGCA AACTTTTTCG CAACAACAGC CCCCAGAAAC TAGTGTGGAT
ATAGAGCAAG CGGCCGAAGG CAGTGGTAAT GAAACTGACA GTTTTTGGGA AGAACCAGCT
GTGCAACAAT TGCGTTCTGC TATGGCTATG CAAGCAGAAG CCGAACCGGA AGAAGATACT
TTGCGTTCTG TTGTAGTTAC CGAATTAATG AATTATCTCG AACAAAGACA ACAATCGGAC
TGTGCTGATT ATTTCTCTCT CCGTCTCCAG GATTTATCAG CACAAGAGAT TGAGTCAGTT
TTGGGTTTAA CTCCGCGTCA ACGAGATTAC TTACAACAGC GTTTTAAGTA TCATTTAATT
CGGTTTGCAC TTTTGCATCG TTGGGAATTA GTTCATGAAT GGTTGGAAGT TTCTTTACCC
ACTAATTTAG GATTAACTCC CCACCAATGG CAAGTCTACA CAGGAAAGTT GGACGATAAA
CAGCGGTCTT TGTTAGATTT GAAGCAACAA GGACAGTCTG ACGAAGCTAT TGCCAAAACT
TTAGGCTTGT CAATGGCACA ACTGCAAAAA CGGTGGTTTA AGATTTTGGA ACAAGCTTGG
GAAATTCGTA ACTCATTTGT GTCCGGATCT GGTGCATCTA CCCATGAATA G
 
Protein sequence
MNSAATATTQ GDKSIGVEVI FQFLLKELQQ STKATHKNCR DVALRITGEV LRICNESKRI 
QASGDIESSA MTLARHRLQQ CLRYYQLGSN RSRVELHSTL SAIIYRYINP PQKQLSYQGR
LTIIEDFLQN FYLEALNAFR RENQQEPTYR PQTLLELAEY MAFTERYGKR RIPLPGRQQQ
LIILRAQTFS QQQPPETSVD IEQAAEGSGN ETDSFWEEPA VQQLRSAMAM QAEAEPEEDT
LRSVVVTELM NYLEQRQQSD CADYFSLRLQ DLSAQEIESV LGLTPRQRDY LQQRFKYHLI
RFALLHRWEL VHEWLEVSLP TNLGLTPHQW QVYTGKLDDK QRSLLDLKQQ GQSDEAIAKT
LGLSMAQLQK RWFKILEQAW EIRNSFVSGS GASTHE