Gene Aazo_1846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1846 
Symbol 
ID9339639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1917333 
End bp1919342 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content41% 
IMG OID 
Productarginine decarboxylase 
Protein accessionYP_003721071 
Protein GI298490894 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGTCG AGTCAAGTGC TGATGAGATT GTCAAAGTGC CTCCTAATGG GCATAAATCC 
GAATTGAAAA GCCAAAAACA CAAGAAACTG CTGCCACCTA CTACGACAGG AGATTTGCCT
CGTGCTTGGA AAATTGAGGA CAGCGAAGAC CTTTACCGAA TTGAAGGTTG GGGAAAGCCT
TATTTTTCCA TTAATGCGGC TGGGAACGTA ACAGTTTCTC CCAAGGGTGA TCGCGGTGGT
TCCTTAGATT TGTTTGAATT AGTCAATGCT TTAAAGCAAC GTAATTTGGG TCTTCCTCTG
TTGATTCGCT TTTCTGATAT TTTAGAAGAC CGAATTGAAC GGTTGAATGC TTGTTTTGCT
AAAGCTATAG CACGTTATAA CTATCCTGGC GTTTATCGTG GTGTTTTTCC AGTTAAATGC
AACCAGGAAA GACACTTGAT AGAAGATTTG GTGCGTTTTG GCAAACCTCA TCAATTTGGA
CTAGAAGCGG GATCTAAGCC AGAATTAATG ATTGCGCTCG CTTTACTGAA TACACCAGGT
GCATTGTTAG TTTGCAATGG TTACAAAGAC CGAGAATACA TTGAAACAGC AATGTTATCT
CAAAGGTTAG GACAAACAGC AATCATTGTT ATAGAACAGA TTGAAGAAGT AGATTTGGTG
ATTGCAGCTA ACCTTCAATT AGGAATTAAA CCCATTTTGG GGGTAAGAGC GAAATTAAGT
ACCCAAGGAA TGGGACGTTG GGGAACTTCC ACAGGTGATC GCGCTAAATT TGGGTTGACA
ATTCCCGAAA TTATGGAAGC GGTTGATAAG TTAAGAGAAG CTAATTTGCT CGGTTGCTTG
CAATTGTTAC ACTTCCATAT TGGCTCACAA ATCTCCGCCA TCAATGTGAT TAAAGATGCC
ATCCAAGAAG CCAGTCGTAT TTATGTAGAA TTGGCGATGT TAGGCGCAGA TATGAAATAT
CTGGATGTTG GTGGTGGCTT GGGTGTAGAT TACGACGGTT CTCAAACTAA TTTTTACGCC
TCTAAAAACT ACAATATGCA AAACTATGCC AATGATATTG TGGCAGAGTT AAAAGATACC
TGTGCAGAAC GTCAAATTAC CGTACCTATA CTCATTAGTG AAAGTGGTAG AGCGATCGCA
TCCCATCAAT CAGTCCTCAT TTTTGACGTT CTCAGCACCA GTGATGTCCC CCTCGAACTC
CCAGATCAAC CACAAGAGGG AGAATCACCA ATCATTAATT ACCTGTGGGA AACCTACCAA
TCTATTAACA AAGAGAACTA TCAGGAGTTC TACCACGACG CGGCTCAATT TAAAGAAGAA
GCCATAAGCC GCTTTAACTT AGGAATTTTA CGACTTAGAG AACGAGCCAA AGCCGAGCGA
CTGTACTGGG CTTGTTGCGG TAAGATTCTA GATATTACAC GACAACAAGA CTACGTACCT
GATGAACTGG AAGACCTAGA AAAAATCATG GCTTCCATCT ATTACATTAA TCTTTCCGTG
TTTCAATCAG CACCAGATTG TTGGGCAATT GATCAACTAT TTCCCATTAT GCCCATACAT
AAGCTAGATC AAGAACCCAC ACAACGAGGA ATTTTGGCAG ACCTCACCTG TGATAGCGAT
GGTAAAATCG ACCGATTTAT CGATCTGCGG GATGTCAAAT CAGTGTTAGA ACTGCATAAA
TTCAAACCAG ATCAACCCTA TTATCTAGGA ATGTTCCTTA ATGGAGCTTA CCAGGAAATC
ATGGGTAATT TACACAACCT ATTTGGTGAC ACCAATGCTG TTCACATCCA ATTAACACCT
AAAGGCTACC AAATTGAACA CGTTGTTAAG GGTGATACCA TGAGTGAAGT AGTTAGCTAC
ATGCAGTATG ACTCCGAGGA TATGGTAGAA AATATCCGCC AGCGTTGTGA AAAAGCCTTA
GAAGAAAATC GCATTACCCT AGCTGAATCT CAACGACTAT TACAAACCTA CGAGCAAAGT
CTCAGGAGAT ATACGTATTT GAATAGTTAG
 
Protein sequence
MGVESSADEI VKVPPNGHKS ELKSQKHKKL LPPTTTGDLP RAWKIEDSED LYRIEGWGKP 
YFSINAAGNV TVSPKGDRGG SLDLFELVNA LKQRNLGLPL LIRFSDILED RIERLNACFA
KAIARYNYPG VYRGVFPVKC NQERHLIEDL VRFGKPHQFG LEAGSKPELM IALALLNTPG
ALLVCNGYKD REYIETAMLS QRLGQTAIIV IEQIEEVDLV IAANLQLGIK PILGVRAKLS
TQGMGRWGTS TGDRAKFGLT IPEIMEAVDK LREANLLGCL QLLHFHIGSQ ISAINVIKDA
IQEASRIYVE LAMLGADMKY LDVGGGLGVD YDGSQTNFYA SKNYNMQNYA NDIVAELKDT
CAERQITVPI LISESGRAIA SHQSVLIFDV LSTSDVPLEL PDQPQEGESP IINYLWETYQ
SINKENYQEF YHDAAQFKEE AISRFNLGIL RLRERAKAER LYWACCGKIL DITRQQDYVP
DELEDLEKIM ASIYYINLSV FQSAPDCWAI DQLFPIMPIH KLDQEPTQRG ILADLTCDSD
GKIDRFIDLR DVKSVLELHK FKPDQPYYLG MFLNGAYQEI MGNLHNLFGD TNAVHIQLTP
KGYQIEHVVK GDTMSEVVSY MQYDSEDMVE NIRQRCEKAL EENRITLAES QRLLQTYEQS
LRRYTYLNS