Gene Ava_2161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2161 
Symbol 
ID3680108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2673212 
End bp2674282 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content46% 
IMG OID637717504 
Productpeptidase U62, modulator of DNA gyrase 
Protein accessionYP_322676 
Protein GI75908380 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.519115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGAAC GCGCTCTTGC TTTGAGCGAA CTGAACCAAT CAGAACCAGT GGAATTGGTC 
AGCAACTCTA AGCCATCCTA CCCAGACTTG GGTGAGGCTG TATCTGTAGA AGTTTTAGTT
GGTTGGGGCA AAGAAGCGAT CGCCATCATC CGCGATAATT ATCCCGATGT CCTCTGTAAT
AGTGACTGGG AATGTGATGT GGAAACGACT AGACTTGTCA ACACTCAAGG TTTAGATTGC
TACTACAGCG ATACTACCCT CAGTTGCTAT ATGTCCGCCG AATGGGTTCG TGGTGATGAT
TTTTTAAGTG TATCTGATGG ACAAACTCAG CGTGATTACT TAGACCCGGA AAAGTTGGCT
TATCAAATTT TACAAAGACT GGTTTGGGCT AAAGAAAACG TCCCACCTCC TAACGGTCGT
GTCCCGGTTT TATTTACCTC CAAGGCGGCG GATATGCTTT GGGGTACGGC GCAAGCAGCG
TTGAATGGCA AACGTGTACT AGAAGCAGCT TCCCCTTGGG CAGAACGCGT GGGTAAACAA
GTAATCGCTC CTAGCCTTAC CCTTTACCAA GACCCCCAAG CCGGGCCTTA TAGCTGCCCC
TTTGATGATG AAGGTACTCC GACCAAATCT TTGGTATTTA TCGAAAAAGG CATATTACAA
AATTATTATT GCGATCGCAC CACCGGAAGG CAACTAGGTA ATAGCACCAC CGGCAATGGT
TTTCGCCCTG GTTTAGGCAG TTATCCCACC CCTGGCTTAT TTAACTTTTT GATTAAGCCT
GGTTCTAAAT CCCTCAAAGA CCTGATCCAA AACATGGATG ATGGCTTGAT TGTAGACCAA
ATGCTCGGTG GTAGTGGTGG TATCTCTGGC GACTTTTCGA TCAATATCGA ATTAGGCTAT
CGAGTCCAAA AAGGTCAAGT AATCGGTCGC GTCAAAGATA CAATGGTCGC AGGCAATGTT
TATACCGCCC TCAAGCAAGT GGAATTAGGC AGTGATGCTG ATTGGAACGG TTCTTGTTAT
ACTCCGTCTT TAATAGTCGA AGGGCTATCG ACGACTGGGA GGAATAATTA G
 
Protein sequence
MVERALALSE LNQSEPVELV SNSKPSYPDL GEAVSVEVLV GWGKEAIAII RDNYPDVLCN 
SDWECDVETT RLVNTQGLDC YYSDTTLSCY MSAEWVRGDD FLSVSDGQTQ RDYLDPEKLA
YQILQRLVWA KENVPPPNGR VPVLFTSKAA DMLWGTAQAA LNGKRVLEAA SPWAERVGKQ
VIAPSLTLYQ DPQAGPYSCP FDDEGTPTKS LVFIEKGILQ NYYCDRTTGR QLGNSTTGNG
FRPGLGSYPT PGLFNFLIKP GSKSLKDLIQ NMDDGLIVDQ MLGGSGGISG DFSINIELGY
RVQKGQVIGR VKDTMVAGNV YTALKQVELG SDADWNGSCY TPSLIVEGLS TTGRNN