Gene Aazo_3522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3522 
Symbol 
ID9341328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3589099 
End bp3590625 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content39% 
IMG OID 
Productadenylylsulfate kinase 
Protein accessionYP_003722253 
Protein GI298492076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAGA CCAATCTTCC AGATTTAATT CAGCAAATGT TAAAACCTGG ATTCTATTCT 
CATCCAGTCA CAGAACCTAT TGAATTAATA CAAACTCACG TTTCTTATGT GCTACTAACT
GGGGATTATG CATACAAACT GAAAAAACCT GTAAATTTTG GCTTTTTAGA CTACTCCACC
TTAGAAAGGC GACAACATTT TTGTCACCAA GAGTTACGCT TAAATGAACG AGGAGCAGGT
GAACTATACT TGGAAGTTTT ACCTGTGACT TTAGAAGGAG AAAAATATCA TTTAGGAGGT
ACAGGAGAGG CAGTCGAATA TGCCCTGAAA ATGCGGCAAT TTCCCCAAGA AGCCCTCTTC
AGTGAACTTC TTGCCCAAAA CAAATTAAAT GAGACTGATT TGGAAGAATT GGGACGATTG
GTAGCTGAAT ATCATGCTCA AGCCCAGACA AATGATCACA TCCGGAGCTT TGGTGAAGTA
TCACAAGTGC GGTTGGCTAT TGATGAAAAT TACGAACAAA CCCTAAAGTA TATTGGTGCT
CCTCAGACAC AGGAACGCTT TGCACAAACA AAAGCATATA CAGATAACTT TTTTGACCAA
CGTCCAGAAT TATTTGTTAT TAGAATTGTC AATAACTATA TTCGTGAATG TCATGGGGAT
TTACACCTGA GAAATATTGC TCGATGGCAC GACAAAATTA TGGTGTTTGA CTGCATAGAG
TTCAATGAGC CATTTCGCTT TGTGGATGTC ATGTATGATG TGGCATTTAC GGTAATGGAT
ATAGAAGCAA AAGGCCGAAA AGATTTAGGT AATGCCTTTT TGAATACTTA TGCAGAACAA
ACTGGTGATT GGGAAGGTTT ACAGGTTTTA CCTTTGTATT TAAGCCGTCA AGCCTATGTC
CGGGCTAAAG TGACTTCGTT TTTATTAGAT GATCCAAATT TACCCGCGAG TGTGAAGGAA
GAAGCAGGAA AGACTGCATC TGCGTATTAT TGCCAAGCAT GGGATTATAC TATACCTAAG
CAGGGTCAAG TGATTTTGAT GTCAGGGTTA TCTGGTTCTG GTAAAAGTAC AACAGCTAGA
TATTTGGCTC GGAAATTAGG TGCAGTTCAT CTCCGTTCTG ATGCCGTCAG AAAACATTTG
GCTGGTATTC CTCTGCTAGA ACGTGGTGGT GATAAAATAT ATACGCCAGA GATGACACAG
AAGACTTATA ACAGGTTGTT GACATTAGGG ATTTCACTTG CTAATCAAGG TTGGTGTGTA
ATTTTAGATG CTAAGTATGA TCGTCAACAT TTGCGACAGG AGGTGATAAC ACAAACACAA
CAAAATCAAT TACCTTTACA GATTATCCAC TGCACTGCAC CAATAGAAGT ATTACAAGAG
CGTCTTGTTA ACCGTACTGG TGATATTGCT GATGCTACTG CGGATTTATT GGTATCTCAA
ATTAAACAAG CTGAACCTTT TACAAATCAG GAACAACCAT ACCTGAAAAT ATTGGATACG
ACTCAATCAT TGGAAGCACA ATTATAG
 
Protein sequence
MTETNLPDLI QQMLKPGFYS HPVTEPIELI QTHVSYVLLT GDYAYKLKKP VNFGFLDYST 
LERRQHFCHQ ELRLNERGAG ELYLEVLPVT LEGEKYHLGG TGEAVEYALK MRQFPQEALF
SELLAQNKLN ETDLEELGRL VAEYHAQAQT NDHIRSFGEV SQVRLAIDEN YEQTLKYIGA
PQTQERFAQT KAYTDNFFDQ RPELFVIRIV NNYIRECHGD LHLRNIARWH DKIMVFDCIE
FNEPFRFVDV MYDVAFTVMD IEAKGRKDLG NAFLNTYAEQ TGDWEGLQVL PLYLSRQAYV
RAKVTSFLLD DPNLPASVKE EAGKTASAYY CQAWDYTIPK QGQVILMSGL SGSGKSTTAR
YLARKLGAVH LRSDAVRKHL AGIPLLERGG DKIYTPEMTQ KTYNRLLTLG ISLANQGWCV
ILDAKYDRQH LRQEVITQTQ QNQLPLQIIH CTAPIEVLQE RLVNRTGDIA DATADLLVSQ
IKQAEPFTNQ EQPYLKILDT TQSLEAQL