Gene Aazo_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3801 
Symbol 
ID9341606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3858134 
End bp3859600 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content38% 
IMG OID 
ProductOrn/Lys/Arg decarboxylase major region 
Protein accessionYP_003722453 
Protein GI298492276 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAATC AAAACCAAAC CCCCCTCATA GATGCCTTAA AAGTCTCTAT CTCTCGTCCC 
CACGCACCAT TCTACACCCC AGGACATAAA CGGGGTGCGG GAATTTCGCC TATTTTAACC
GATTTACTAG GTAAAGAAGT TTTCCGTGCT GATTTAACAG AACTTGCAGA ATTAGATCAC
CTCTTCACAC CTGAAAGCGC AATTTTAGCA GCACAAGAAT TAGCGGCAGT GGCTTTTGGT
GCGGAAAAAA CGTGGTTTTT AGTTAATGGT TCTACTTGTG GAATTGAAGC TGCAATTCTC
GCTACTTGCG GTATGAATGA TAAAATTATT CTGCCGCGAA ATGTGCATTC TTCGGTAATT
TCTGGATTAA TTCTTTCTGG TGCAATTCCT ACTTTTATAA ATCCCGAATA TGATCAAGAC
TTAGATTTTG CTCACAGTAT TACACCGGAA GCTGTAAAAA CAGCATTAGC AAAATATCCT
GATGCAAAAG CAGTGCTGAC TGTTTATCCT ACTTATTACG GTGTTTGTGG AGATTTGAGT
GCGATCGCAC AAATTACCCA TCAACATCAC ATTCCCCTCA TTGTTGATGA AGCACATGGG
GCACATTTCT CTTTTCATCC TCATTTACCC ACATCAGCTT TAACCGCAGG TGCAGATTTA
ACCATACAAT CTATTCACAA AACCTTGGGT GCAATGACAC AGGCATCAAT GTTGCACATC
CAAGGCAATA GAATTGATAT TGATAGATTA AATAAATCCT TACAGTTAGT TCAATCTACA
AGTCCCAGCT TTATTCTTTT AGCTTCCCTT GATGCCGCAC GTCAACAAAT GGCTATCAAT
GGGGAATGGT TGATGTCTCA AACTTTGCAA TTAGCTGAAG CAGCAAGAAG TCAAATTAGC
CAAATTCCTG GTTTATCAGT TTTAGAGATC CCCCCAACCC CCCTTTTTAA GGAGGGCTTT
GTGGATTTAG ATCAAACACG GTTAACTATT AATATTTCTG AATTAGGTTT AACAGGGTTT
GAAGCTGAAG AAATTCTCAA TGAAATGGGT GTTACCTCAG AATTTTCATC CCTACAAAAT
ATTACTTTTA TTATTAGTTT GGGTAATATT TGGACAGATA TAGATGCATT AGTACAAGGA
TTAAAAAATT TGACTCGGAT ACCACAATTG ACAAGTCAGT ACAAATTATG TAAATATACA
AACGATGCTA TGATTAGCCT TAATATGTGC ATTTCTCCCC GTGAGGCTTT TTTTGCTAAC
AGTGAAATAT TGCCTTTGGA GAAAACGGAA GAAAGAATTT GTGCAGAAAT TATTTGCCCA
TATCCTCCAG GAATTCCTGT ATTAATGCCG GGAGAAATCA TTACAAAATC GGCTTTAGAA
TATCTGCTAC AAATTCAGTC TTTGGGAGGA TTTATTACTG GTTGTATGGA TACAAGCCTC
CGCAGCGTAA AGGTCATCAA AACCTAA
 
Protein sequence
MLNQNQTPLI DALKVSISRP HAPFYTPGHK RGAGISPILT DLLGKEVFRA DLTELAELDH 
LFTPESAILA AQELAAVAFG AEKTWFLVNG STCGIEAAIL ATCGMNDKII LPRNVHSSVI
SGLILSGAIP TFINPEYDQD LDFAHSITPE AVKTALAKYP DAKAVLTVYP TYYGVCGDLS
AIAQITHQHH IPLIVDEAHG AHFSFHPHLP TSALTAGADL TIQSIHKTLG AMTQASMLHI
QGNRIDIDRL NKSLQLVQST SPSFILLASL DAARQQMAIN GEWLMSQTLQ LAEAARSQIS
QIPGLSVLEI PPTPLFKEGF VDLDQTRLTI NISELGLTGF EAEEILNEMG VTSEFSSLQN
ITFIISLGNI WTDIDALVQG LKNLTRIPQL TSQYKLCKYT NDAMISLNMC ISPREAFFAN
SEILPLEKTE ERICAEIICP YPPGIPVLMP GEIITKSALE YLLQIQSLGG FITGCMDTSL
RSVKVIKT