Gene Aazo_4395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4395 
Symbol 
ID9342199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4473412 
End bp4475319 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content44% 
IMG OID 
Productdeoxyxylulose-5-phosphate synthase 
Protein accessionYP_003722838 
Protein GI298492661 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.326234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCTGA GTGAAATAAC CCATCCTAAC CAGTTACATG GTTTGTCTAT TCGGCAATTG 
CAACAAATTG CCCGTCAAAT CCGAGATAAG CACCTACAAA CCGTAGCAGC GACTGGAGGG
CATTTGGGGC CAGGTTTGGG AGTTGTAGAA TTAACTCTAG GGCTTTACCA GACATTAGAC
TTAGATCGGG ACAAAGTTAT TTGGGATGTG GGACATCAAG CTTATCCCCA CAAATTAATC
ACCGGACGTT ACAGTAACTT CCACACTCTC AGACAAAAAG ACGGAGTAGC CGGTTATCTC
AAACGCTGCG AAAACAAGTT TGACCACTTT GGCGCAGGAC ACGCTTCAAC CAGTATCTCA
GCCGCTTTAG GCATGGCTTT AGCCCGTGAC TTAAAAGGGG AAAAATTTAA ATCCGTTGCA
GTCATTGGAG ATGGCGCTTT AACTGGTGGT ATGGCGCTCG AAGCTATCAA CCATGCAGGA
CACTTACCAA AAACTAACCT GCTCGTGGTT CTCAATGACA ACGAAATGTC AATTTCTCCC
AACGTTGGAG CGATTCCTCG TTATCTTAAT AAAATGCGTC TGAGTCCGCC GGTGCAGTTT
CTTTCAGATG GCATTGAGGA ACAGCTAAAA CATATTCCTT TCGTTGGTGA ATCTATTTCC
CCAGAACTGG AACGCATTAA AGAAGGAATG AAGCGGTTAG CAGTTCCCAA AGTGGGTGCA
GTTTTTGAAG AACTGGGCTT TACCTACATG GGACCAATGG ATGGGCATAA TTTAGAGGAG
TTGATTGCTA CATTCCAACA GGCACATAAA ATTACTGGTC CTGTTTTAGT TCACGTAGTT
ACAACTAAAG GTAAAGGGTA TGAACTAGCT GAAAAGGATC AAGTAGGCTA CCATGCTCAA
AACCCATTTA ATTTAGCCAC TGGTAAAGCT ATACCTTCCA GCAAGCCCAA ACCACCTGCT
TACGCAAAAG TCTTTTCTCA CACCTTAGTC AAACTAGCCG AACAGAACCC GAAAATTGTT
GGTATTACGG CAGCAATGGC CACAGGTACA GGTTTAGATA AACTACAAGC CAAACTTCCA
AATCAATATA TTGATGTGGG TATTGCAGAA CAACACGCTG TTACTCTCGC TGCGGGATTA
GCCGCTGAAG GTATGCGTCC CGTTGCGGCT ATTTATTCCA CCTTCTTACA ACGTGCCTAC
GACCAAATTA TTCATGATGT CTGCATCCAA AACCTACCTG TGTTCTTCTG TTTAGACAGG
TCCGGTATAG TTGGTGCTGA TGGGCCAACT CACCAAGGTA TGTATGATAT TGCTTATATG
CGATGTATTC CTAACATGGT AGTAATGGCT CCCAAGGATG AAGCTGAACT ACAACGCATG
GTAGTGACAG GTATTAACCA TACCACTAGC CCCATTTCTA TGCGCTTCCC CCGTGGTAAT
GGTCACGGTG TACCTTTAAT GGAAGAAGGT TGGGAACCTT TGGAAATTGG TAAAGGAGAA
ATTCTCCGTC AAGGTGATGA TGTGTTAATT CTTGGCTATG GCACAATGGT TTATCCAAGT
ATGCAAGCAG CAGAAATACT CAGCGAACAT GGCATTGAAG CAACAGTGAT TAATGCCCGT
TTCGTTAAGC CTTTAGACAC AGAGTTGATT GTACCTTTGG CTAAACAAAT CGGCCGGGTT
GTTTCTTTAG AAGAAGGCTG TTTAATGGGT GGCTTTGGTT CTGCGGTGGC TGAAGCTTTA
ATGGATGCTA ATGTTTTAGT TCCAGTGAAG CGAATTGGTG TACCAGATAT TTTGGTAGAT
CATGCTACTC CTGATGAATC TTTTGCAGTG TTAGGTTTGA GTAGTCGTCA AATTGTGGAA
ACTGTTTTGC AGGCTTTCTT CAAAAAAGAA TTAGCTGTTG TGAAATAA
 
Protein sequence
MHLSEITHPN QLHGLSIRQL QQIARQIRDK HLQTVAATGG HLGPGLGVVE LTLGLYQTLD 
LDRDKVIWDV GHQAYPHKLI TGRYSNFHTL RQKDGVAGYL KRCENKFDHF GAGHASTSIS
AALGMALARD LKGEKFKSVA VIGDGALTGG MALEAINHAG HLPKTNLLVV LNDNEMSISP
NVGAIPRYLN KMRLSPPVQF LSDGIEEQLK HIPFVGESIS PELERIKEGM KRLAVPKVGA
VFEELGFTYM GPMDGHNLEE LIATFQQAHK ITGPVLVHVV TTKGKGYELA EKDQVGYHAQ
NPFNLATGKA IPSSKPKPPA YAKVFSHTLV KLAEQNPKIV GITAAMATGT GLDKLQAKLP
NQYIDVGIAE QHAVTLAAGL AAEGMRPVAA IYSTFLQRAY DQIIHDVCIQ NLPVFFCLDR
SGIVGADGPT HQGMYDIAYM RCIPNMVVMA PKDEAELQRM VVTGINHTTS PISMRFPRGN
GHGVPLMEEG WEPLEIGKGE ILRQGDDVLI LGYGTMVYPS MQAAEILSEH GIEATVINAR
FVKPLDTELI VPLAKQIGRV VSLEEGCLMG GFGSAVAEAL MDANVLVPVK RIGVPDILVD
HATPDESFAV LGLSSRQIVE TVLQAFFKKE LAVVK