Gene Ava_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3988 
Symbol 
ID3679662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4958808 
End bp4960511 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content44% 
IMG OID637719340 
Producthistidine ammonia-lyase 
Protein accessionYP_324488 
Protein GI75910192 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase
[TIGR01226] phenylalanine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.488939 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACAC TATCTCAAGC ACAAAGCAAA ACCTCATCTC AACAATTTTC TTTTACTGGA 
AATTCTTCTG CCAATGTAAT TATTGGTAAT CAGAAACTCA CAATCAATGA TGTTGCAAGG
GTAGCGCGTA ATGGCACCTT AGTGTCTTTA ACCAATAACA CTGATATTTT GCAGGGTATT
CAGGCATCTT GTGATTACAT TAATAATGCT GTTGAATCTG GGGAACCAAT TTATGGAGTG
ACATCTGGTT TTGGCGGTAT GGCCAATGTT GCCATATCCC GTGAACAAGC ATCTGAACTC
CAAACCAACT TAGTTTGGTT CCTGAAAACA GGTGCAGGGA ACAAATTACC CTTGGCGGAT
GTGCGCGCAG CTATGCTCTT GCGTGCAAAC TCTCATATGC GCGGTGCATC TGGCATCAGA
TTAGAACTTA TCAAGCGTAT GGAGATTTTC CTTAACGCTG GTGTCACACC ATATGTGTAT
GAGTTTGGTT CAATTGGTGC AAGTGGTGAT TTAGTGCCAC TATCCTACAT TACTGGTTCA
CTGATAGGCT TAGATCCCAG TTTTAAGGTT GACTTCAACG GTAAAGAAAT GGATGCGCCA
ACAGCTCTAC GTCAACTGAA TTTGTCACCC TTGACATTGT TGCCGAAGGA AGGCTTGGCG
ATGATGAACG GCACTTCAGT CATGACAGGT ATTGCAGCAA ACTGCGTCTA CGATACTCAA
ATTTTAACTG CGATCGCTAT GGGCGTTCAC GCTCTAGATA TCCAAGCTTT AAACGGAACC
AATCAATCAT TCCATCCATT TATCCATAAT TCCAAACCAC ATCCTGGTCA ATTATGGGCA
GCAGATCAGA TGATTTCTTT GTTAGCCAAT TCCCAGTTAG TTCGTGATGA GTTAGATGGT
AAACACGATT ATCGTGATCA CGAGTTGATT CAAGATCGTT ACTCACTCCG ATGCCTTCCC
CAGTATTTGG GGCCAATCGT TGATGGAATT TCCCAGATTG CCAAACAAAT TGAAATCGAA
ATCAACTCAG TCACCGATAA CCCACTAATT GATGTTGATA ACCAAGCTAG CTATCATGGA
GGAAATTTCC TCGGACAGTA CGTGGGTATG GGAATGGATC ACCTGCGTTA CTATATTGGG
TTATTGGCTA AACACCTAGA TGTGCAGATT GCCCTCCTCG CCTCACCAGA GTTTAGCAAT
GGACTACCAC CATCTTTATT AGGCAACCGA GAACGTAAAG TCAATATGGG ACTCAAAGGT
CTGCAAATAT GCGGTAACTC AATTATGCCA CTGTTGACCT TCTATGGAAA TTCCATCGCC
GATCGCTTTC CTACCCATGC AGAACAATTT AATCAGAACA TCAACAGTCA AGGATACACT
TCAGCGACTC TAGCCCGCCG TTCTGTGGAT ATCTTCCAGA ATTATGTGGC GATCGCTCTG
ATGTTTGGAG TCCAAGCTGT TGACCTCCGC ACATATAAAA AGACTGGTCA TTACGATGCA
CGCGCCTGTC TATCACCTGC AACTGAGCGC TTATATTCAG CAGTCCGCCA CGTAGTTGGA
CAAAAACCAA CTTCAGATCG CCCATATATT TGGAATGATA ATGAGCAAGG ACTGGATGAG
CATATTGCCC GGATTTCTGC TGATATCGCT GCTGGTGGTG TGATTGTGCA AGCAGTTCAA
GATATCTTAC CCTGCTTGCA TTAA
 
Protein sequence
MKTLSQAQSK TSSQQFSFTG NSSANVIIGN QKLTINDVAR VARNGTLVSL TNNTDILQGI 
QASCDYINNA VESGEPIYGV TSGFGGMANV AISREQASEL QTNLVWFLKT GAGNKLPLAD
VRAAMLLRAN SHMRGASGIR LELIKRMEIF LNAGVTPYVY EFGSIGASGD LVPLSYITGS
LIGLDPSFKV DFNGKEMDAP TALRQLNLSP LTLLPKEGLA MMNGTSVMTG IAANCVYDTQ
ILTAIAMGVH ALDIQALNGT NQSFHPFIHN SKPHPGQLWA ADQMISLLAN SQLVRDELDG
KHDYRDHELI QDRYSLRCLP QYLGPIVDGI SQIAKQIEIE INSVTDNPLI DVDNQASYHG
GNFLGQYVGM GMDHLRYYIG LLAKHLDVQI ALLASPEFSN GLPPSLLGNR ERKVNMGLKG
LQICGNSIMP LLTFYGNSIA DRFPTHAEQF NQNINSQGYT SATLARRSVD IFQNYVAIAL
MFGVQAVDLR TYKKTGHYDA RACLSPATER LYSAVRHVVG QKPTSDRPYI WNDNEQGLDE
HIARISADIA AGGVIVQAVQ DILPCLH