Gene Aazo_0739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0739 
Symbol 
ID9338525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp780888 
End bp782507 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content39% 
IMG OID 
Productprocessing peptidase 
Protein accessionYP_003720314 
Protein GI298490137 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATT CAATTTTGGA TTCCTGGAAG AGTAAAACTA ACAAGTCTCA ATTTAGATAC 
TCTCAATTAT TCCACAAATT TCCTTTCTCT CCAGTTTTGC ATCGTTTGTT AATAACTGCT
TTGGCGGTAA TGATATTTTG TTTGGGTGTA ACACCACAAG CTGCAATCGC ACTGGAACCT
ACCCAAAGTT TAATTCAACC ATATTTAGAT CAAGTAGTTA AGCAATTAAC AGAATTTCGC
CTTGATAATG GCTTGAAGTT CATTGTTTTG GAAAGACACC AAGTGCCAGT GGTTTCTTTT
TTGACTTATG CTGATGTGGG TGGGATAGAT GAATCAGATG GTAAAACAGG TGTAGCTCAC
TTTTTAGAAC ATTTGGCTTT TAAAGGGACA AAACGTATTG GGACAATAGA TTACAAAGCC
GAAAAACCGC TACTAGAAAG TTTAGAACAG TTAGATACCC AAATTAGAAC TGCTAAATCC
CAAGGTAAAA ATGATGATTT AGCAAAGTTA GAAACTGAGT TTAAATCATT AGAATCTCAA
GCACTGAAAC TAGTCAAACA GAACGAAATC GGGCAAATTG TTGAGCAAGC GGGGGGTGTA
GGTTTAAATG CGAATACTTC CACCGAAGCT ACACGTTATT TCTACAGTTT TCCGGCTAAT
AAGTTGGAAC TGTGGATGTC TCTGGAGTCA GAAAGATTTT TAGATCCTGT ATTTCGGGAG
TTTTATAAAG AAAAAGATGT GATTTTGGAA GAAAGACGGA TGCGGGTAGA AAACTCTCCC
GTTGGTTTGA TGGTTGAGAA ATTTATTGAT ACTGCTTTCA AAGTTCATCC CTACAGACGA
CCAGTGATTG GTTATGACGA AGATATTCGC AATTTGTCAC CGAAAGATGT GAAGCAGTTT
TTTGACAGTT ATTATGTTCC TAACAATCTA GTGATCGCTA TTGTCGGTGA TGTTAAGCCT
AATGAGGTGA AAAAATTAGC ACAAGTTTAC TTTGGACGTT ATCCGGCTAA ACTTAAAGCC
CAAGCAAAAA TCACCCCCGA ACCTAAGCAA ACTGAACCCA GAGAATTTAC CTTAAAGTTA
CCTACTCAAC CTTGGTATTT CCAAGGTTAT CACCGTCCAG GTATCACCCA CCCCGATAAT
GCAGTTTACG ACATTATTGG TAGTTTATTG AGTGATGGGC GCACTTCACG ATTGTATAAG
TCTTTGGTGG AAAAGCAGAG TTTAGCCCTT GCGGCTCACG GTGTAAGTGG GTTTCCTGGT
GATAAATATC CCAATTTGAT CTTATTTTAT GCTCTCACAT CTCCAGGTCA TACAGTTGAT
GATTTGGCAA TCGCACTGGG AAAAGAAATC GAAAAGTTGA AAACTGAGCC TGTCTCCACA
ACTGAGTTAC AGCGAGTGAA AACTCAAGCG CGGGCTGGTT TGTTACGTAG TCTTGATTCC
AATATGGGTA TGGCGCAGCA GTTATTGGAA TATGAAGTGA AAACCGGCTC TTGGCGGAAT
TTATTTAAGC AGTTAGAGAA TATTGCAGCA GTGACACCTG CTGATATTCA GCGTGTAGCA
CAGGTGACTT TTACCCGGGA AAATTGCACT GTTGGTAAGT TGTTGTCGAA GGAAGGATAG
 
Protein sequence
MNYSILDSWK SKTNKSQFRY SQLFHKFPFS PVLHRLLITA LAVMIFCLGV TPQAAIALEP 
TQSLIQPYLD QVVKQLTEFR LDNGLKFIVL ERHQVPVVSF LTYADVGGID ESDGKTGVAH
FLEHLAFKGT KRIGTIDYKA EKPLLESLEQ LDTQIRTAKS QGKNDDLAKL ETEFKSLESQ
ALKLVKQNEI GQIVEQAGGV GLNANTSTEA TRYFYSFPAN KLELWMSLES ERFLDPVFRE
FYKEKDVILE ERRMRVENSP VGLMVEKFID TAFKVHPYRR PVIGYDEDIR NLSPKDVKQF
FDSYYVPNNL VIAIVGDVKP NEVKKLAQVY FGRYPAKLKA QAKITPEPKQ TEPREFTLKL
PTQPWYFQGY HRPGITHPDN AVYDIIGSLL SDGRTSRLYK SLVEKQSLAL AAHGVSGFPG
DKYPNLILFY ALTSPGHTVD DLAIALGKEI EKLKTEPVST TELQRVKTQA RAGLLRSLDS
NMGMAQQLLE YEVKTGSWRN LFKQLENIAA VTPADIQRVA QVTFTRENCT VGKLLSKEG