Gene Aazo_1169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1169 
Symbol 
ID9338964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1251986 
End bp1253374 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content45% 
IMG OID 
Productargininosuccinate lyase 
Protein accessionYP_003720614 
Protein GI298490437 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.807635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAA AACAAACTTG GAGCCAGCGG TTTGAATCTA CACTACATCC AGCGATTGCA 
CGTTTTAATG CTAGTATAAG CTTTGACATT GAACTCATAG AGTATGATAT TACTGGTTCT
CAAGCTCATG CTCAAATGTT GGCTCATACA GGGATTATTT CCCCAGAAGA AGCAAAGCAA
TTAGTTACAG GTTTAGATAA AATTCGGCAA GAATACCGAC AGGGCAAATT TCAGCCTGGT
ATTGATGCAG AAGATGTGCA TTTTGCTGTT GAAAAGCGAC TTACAGAAAT TGCTGGTGAT
GTCGGTAAAA AGCTACATAC CGCCCGTTCT CGTAATGACC AAGTAGGCAC TGATACTAGA
CTCTACCTCC GTGACCAAAT CCAACAAATC CGCCAGCATT TACGGGAATT TCAAGCCGTC
TTACTCGACA TAGCTGAAAA AAATATTGAA ACCCTGATTC CTGGTTATAC TCACCTCCAA
CGCGCCCAAC CCGTCAGTTT AGCGCACCAC CTCCTGGCAT ACTTTCAGAT GGCACAACGA
GACCTGGAAC GCTTAGGAGA TGTTTATCAA CGAGTGAATA TTTCACCCTT GGGTTGTGGT
GCTTTAGCGG GAACAACTTT TCCCATTGAC CGACATTACA CCGCCAAATT ATTGCATTTT
GATCAAATTT ATGCTAACAG CTTGGATGGG GTGAGCGATC GCGACTTTGC CATAGAATTT
CTGTGTGCAG CCAGTATTAT CATGGTTCAC CTCAGCCGTC TTTCAGAAGA AGTCATTCTC
TGGTCATCAG AAGAATTTCG CTTTGTCACC CTCAAAGATA GCTGTGCCAC AGGTTCCAGC
ATCATGCCCC AAAAGAAAAA TCCCGACGTG CCAGAACTAG TACGAGGCAA AACCGGGCGT
GTCTTTGGTC ATCTTCAGGC GATGTTAGTC ATCATTAAGG GACTACCCCT GGCATATAAC
AAAGACCTGC AAGAAGACAA AGAAGGCATA TTTGACAGTG TGAATACCAT CAAAGCCTGT
CTAGAAGCCA TAACCATTTT GTTGAGAGAA GGTTTAGAAT TTAGTCCCGA GCGATTAGCA
CAAGCAGTTA CAGAAGACTT TTCTAACGCT ACTGATGTCG CAGACTATCT AGCCGCACGG
GGAGTACCAT TCCGCGAAGC TTACAACCTT GTAGGTAAGG TAGTAAAAAC GAGCATAGGC
GCAGGTAAAC TACTCAAAGA CTTAACCCTA GAAGAATGGC AACAAATACA CCCAGCATTT
GCAGCCGATA TTTATGAAGC CATATCCCCC TATCAAGTAG TTGCAGCCCG CAACAGTTAC
GGTGGTACTG GTTTTGCACA AGTTCGCCAA GCTCTTCTTG CAGCCCGTAC TCAAATCAGT
GCTGAATGA
 
Protein sequence
MTEKQTWSQR FESTLHPAIA RFNASISFDI ELIEYDITGS QAHAQMLAHT GIISPEEAKQ 
LVTGLDKIRQ EYRQGKFQPG IDAEDVHFAV EKRLTEIAGD VGKKLHTARS RNDQVGTDTR
LYLRDQIQQI RQHLREFQAV LLDIAEKNIE TLIPGYTHLQ RAQPVSLAHH LLAYFQMAQR
DLERLGDVYQ RVNISPLGCG ALAGTTFPID RHYTAKLLHF DQIYANSLDG VSDRDFAIEF
LCAASIIMVH LSRLSEEVIL WSSEEFRFVT LKDSCATGSS IMPQKKNPDV PELVRGKTGR
VFGHLQAMLV IIKGLPLAYN KDLQEDKEGI FDSVNTIKAC LEAITILLRE GLEFSPERLA
QAVTEDFSNA TDVADYLAAR GVPFREAYNL VGKVVKTSIG AGKLLKDLTL EEWQQIHPAF
AADIYEAISP YQVVAARNSY GGTGFAQVRQ ALLAARTQIS AE