Gene Ava_0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0438 
Symbol 
ID3682599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp558378 
End bp559349 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content46% 
IMG OID637715767 
Productpeptidase S51, dipeptidase E 
Protein accessionYP_320959 
Protein GI75906663 
COG category[P] Inorganic ion transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4242] Cyanophycinase and related exopeptidases 
TIGRFAM ID[TIGR02069] cyanophycinase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.148568 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.128845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAGGG CATTCCCAAA TATAGCAAAG TGCTGGAAGA ATTTTATATC CCGTCTTACA 
GCATACCTGA AACTCTTGTT TCAGGGCAAC ACTGTCGAGG TTCTTCCTGC CCTAGCTGGC
CCTGTCCTTA ACTTAGGTGG GGGCGGCCCG GATGTGGAAG CGGCTATTCA GTGGATGATT
AACCAAGTCA GGGGAACCGC TAAGGTTAAC GTTGTCGTTC TCCGCACTTA CGGCGGGAAT
GACTACAATC ACCTGATTTA TCGAATGTCC GGTGTAAACT CCGTGCAGAC GTTGGTTATC
AGCAATCGCC AAGATGCTAA CAGAGATGAT ATTGTCCAGA AAGTCCTCAA TGCTGGTGTA
GTTTTCTTCG CAGGTGGCGA CCAATGTCAA TATATTCGTA GCTGGAAAAA CACCAAGTTA
GAAAAAGCTG TAGCGTCAGT TTACCGCAGA GGCGGAGCCG TTGGTGGTAC TAGCGCAGGG
GCAATGATTT TGAGTGATTT CATCTACGAC GCTTGTGCTT GCGAAGACCC CATCGAAACC
AAAGACGCAC TTGAAGACCC TTACCAAAAC ATTACCTTTA CCTACAACTT CTTCCAATGG
TCACATTTAC AAGGAACCAT CATTGACACC CACTTTGATA GTCGTAAACG CATGGGAAGA
ATCATGGCTT TTATTGCCAG ACAAATTCAA GATGGTGTGT CTAGAAGTGC CTTAGGGATA
GCCATTAGTG AGGAGACATC AGTTGTTGTA GATAAATATG GTAAGGCAAA AGTCTTGGGG
AGAAACGCCG CTTATTTCGT CTTAGGCGAC CATCCCCCAG AAATCTGCAA ACCCCGGACA
CCACTCACAT ATTCTGACTA CAAAATTTGG CGTATCCCCT GTGGCGACAC TTTTGATTTA
AATAATCCAC CAGCCAGGGG TTATTACTTC AGGAGTGTGA AGCGCGGGAG GTTCAATTCA
GACCCTTATT AG
 
Protein sequence
MTRAFPNIAK CWKNFISRLT AYLKLLFQGN TVEVLPALAG PVLNLGGGGP DVEAAIQWMI 
NQVRGTAKVN VVVLRTYGGN DYNHLIYRMS GVNSVQTLVI SNRQDANRDD IVQKVLNAGV
VFFAGGDQCQ YIRSWKNTKL EKAVASVYRR GGAVGGTSAG AMILSDFIYD ACACEDPIET
KDALEDPYQN ITFTYNFFQW SHLQGTIIDT HFDSRKRMGR IMAFIARQIQ DGVSRSALGI
AISEETSVVV DKYGKAKVLG RNAAYFVLGD HPPEICKPRT PLTYSDYKIW RIPCGDTFDL
NNPPARGYYF RSVKRGRFNS DPY