Gene Aazo_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2221 
Symbol 
ID9340020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2309909 
End bp2311708 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content44% 
IMG OID 
Productaspartate kinase 
Protein accessionYP_003721340 
Protein GI298491163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.175544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCTTA TAGTTCAGAA ATACGGTGGT TCATCTGTTG GTTCTGTAGA ACGTATCCAA 
GCGGTTGCAA AGCGGGTTTA CAAAACTGTG CAAGCAGGAA ATTCCGTTGT CGTAGTCGTT
TCCGCGATGG GAAAAACCAC CGATGGACTG GTGAAACTAG CCAATGAAAT CTCTAAAAGT
CCTAACCGTC GGGAAATGGA TATGCTGCTT TCCACTGGGG AGCAAGTCAC GATCGCTTTA
TTGAGTATGG CTTTGCAGGA AATTGGACAA GCGGCAATTT CTATGACTGG CGCTCAGGTA
GGAATTGTTA CCGAAGCTGA ACACACCCGC GCTCGGATTT TGCATATTGA AACAGGGCGT
TTGATGGGGC AGATTAATTT AGGTAAAGTG GTTGTTGTGG CTGGATTCCA AGGTATTTCT
AGCGCTAGAA AAATGGAAAT TACTACTTTG GGACGTGGTG GTTCTGACAC TTCTGCAGTA
GCTTTGGCAG CGGCATTAGG GGCAAATTTC TGTGAAATTT ATACAGATGT ACCAGGTATT
TTAACTACAG ACCCCCGCCT TGTCCCAGAA GCCCAGTTGA TGACAGAAAT CACCTGTGAT
GAAATGCTGG AACTAGCTAG TTTAGGTGCG AAAGTATTAC ATCCCCGTGC GGTGGAAATA
GCGAGGAATT ATGGTGTGCC TTTGGTGGTC CGCTCAAGTT GGACCGATCA ACCAGGGACT
TGGGTCACAA CTTCCAGAAC TCAAGAGCGA TCGCTCGTCA ATTTAGAATT AGCTCGTCCT
GTGGATGCGA TAGAATTTGA TATAAACCAG GCTAAATTCT CTTTGCTGCG TGTACCAGAT
AAGCCGGGAG TGGCAGCGCG GTTATTTGGG GAAATTTCCC GGCAAAATGT TGATGTAGAT
TTGATTATTC AGTCAATTCA TGAAGGTAAT ACTAATGATA TTGCTTTCAC AGTAAATACA
CATATATTAA AACGCGATGA AGCTATAGCA GCCTCTATTG CCCCGGCTTT GAGAAGTCAA
CCTAATTCAG ATGAAGCTGA AGTTTTAGTA GAAAGTAATA CAGCCAAAGT GAGTATTTCT
GGGGCGGGAA TGATTGGCCG TCCTGGTGTG GCTGCCAAGA TATTTGCTAC CTTAGCGCAA
GCTAAAGTAA ATATTCAAAT GATTTCTACC AGTGAAGTGA AAGTGAGTTG CTTGGTAGAT
GCGACAGATT GCGATCGCGC TATTGTTGCT CTCTGTAACG CTTTTGAAAT TACTGCTTCC
CCCGCTGTCC TTGCTTCCCC AACTCCTGAA TCTCCTGCTG TGCGTGGTGT TGCTTTAGAT
ATGAATCAAG CGCGGTTAGC AATTCGCCAA GTTCCAGATC AACCAGGGAT GGCTGCAAAA
TTGTTTGGAT TATTGGCAGA ATATAACATC AGCGTGGATA TGATTATTCA GTCCCAACGC
TGCCGGGTAG TGGATGGTGT AACACGTCGG GATATTGCCT TTACTGTGGC TAGGATGGAT
GTAGAAAACG CCCAACAAAA ATTAACCCAA GTAGCAGATG AACTAGGATG GGGTGAAGTA
GTTTTAGATA ATGCGATCGC CAAAGTCAGT ATCGTTGGTT CTGGGATGGT AGGACAACCA
GGTATTGCAG CCAAAATGTT TACAGCTTTA GCAGAAAATA AAATTAACAT CCAAATGATT
GCTACTTCGG AAATTAAAAT TAGTTGTGTT GTGGGACAAG ATGAAGGTGT CAAAGCTTTA
CAAGTCATTC ATACAGCTTT TGATTTAGCT GGTAGTGAAA AATTTGTAGT CCCAGTGTGA
 
Protein sequence
MALIVQKYGG SSVGSVERIQ AVAKRVYKTV QAGNSVVVVV SAMGKTTDGL VKLANEISKS 
PNRREMDMLL STGEQVTIAL LSMALQEIGQ AAISMTGAQV GIVTEAEHTR ARILHIETGR
LMGQINLGKV VVVAGFQGIS SARKMEITTL GRGGSDTSAV ALAAALGANF CEIYTDVPGI
LTTDPRLVPE AQLMTEITCD EMLELASLGA KVLHPRAVEI ARNYGVPLVV RSSWTDQPGT
WVTTSRTQER SLVNLELARP VDAIEFDINQ AKFSLLRVPD KPGVAARLFG EISRQNVDVD
LIIQSIHEGN TNDIAFTVNT HILKRDEAIA ASIAPALRSQ PNSDEAEVLV ESNTAKVSIS
GAGMIGRPGV AAKIFATLAQ AKVNIQMIST SEVKVSCLVD ATDCDRAIVA LCNAFEITAS
PAVLASPTPE SPAVRGVALD MNQARLAIRQ VPDQPGMAAK LFGLLAEYNI SVDMIIQSQR
CRVVDGVTRR DIAFTVARMD VENAQQKLTQ VADELGWGEV VLDNAIAKVS IVGSGMVGQP
GIAAKMFTAL AENKINIQMI ATSEIKISCV VGQDEGVKAL QVIHTAFDLA GSEKFVVPV