Gene Ava_2444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2444 
Symbol 
ID3683087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3034506 
End bp3036407 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content44% 
IMG OID637717787 
ProductFtsH peptidase 
Protein accessionYP_322954 
Protein GI75908658 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000343724 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATT TTGGGAAAAA GGCATTGATA AAACAGCAAT CACCAAAGCG CGTTGCTTGG 
ACTGGTGCTT TGGCAGCCAG TTTGATTATG TTACCAACGA TGTTTGGCGG TAATCCTGTC
TTAGCGCAAA AAGCAGAGCG TGAGTCTCTG TCATACGGAG AGTTGATTCA AAAAGTTAAT
CAAGAGCAAG TCAAAAGAGT AGAACTGGAC GAAACTGAAC AGATAGCTAA AGTTTATTTA
AAAGGGCAAA AACCAGACGC ACCACCAATA CAGGTGAGGT TGTTGGAGCA GAACAACGAG
TTAATTAACA GACTCAAAGA AAAAAATGTT GATTTTGGTG AGATTTCTTC TGCCAATAGT
AGAGCTGCTG TAGGGTTATT AATTAACCTG ATGTGGATTT TGCCATTGGT GGCTTTAATG
CTGCTATTTC TGCGTCGTTC TACAAATGCT TCTAGCCAAG CAATGAATTT TGGCAAATCT
AGGGCGCGTT TCCAAATGGA AGCCAAGACT GGGGTGAAGT TTGACGATGT AGCGGGTATT
GAAGAAGCGA AGGAAGAATT ACAAGAAGTT GTGACATTCC TCAAGCAGCC AGAAAGATTT
ACGGCTGTGG GTGCGCGGAT ACCTAAAGGT GTGCTGTTGG TGGGGCCTCC AGGTACTGGT
AAAACTTTAC TAGCAAAAGC GATCGCTGGG GAAGCGGCTG TACCATTTTT CAGCATTTCC
GGTTCGGAAT TTGTGGAAAT GTTCGTGGGT GTGGGTGCTT CTCGCGTCCG CGATTTGTTT
AAGAAAGCTA AAGACAATGC GCCTTGTCTG ATATTTATCG ATGAAATCGA TGCAGTTGGC
AGACAACGGG GTACGGGTAT TGGTGGGGGT AACGATGAGA GAGAACAAAC CCTCAATCAG
TTACTCACGG AGATGGATGG TTTTGAAGGT AACACAGGCA TCATTATTAT TGCTGCAACC
AACCGTCCCG ACGTATTAGA TTCAGCTTTG TTACGTCCTG GTCGTTTCGA CAGACAAGTA
ATTGTTGATG CACCAGACTT GAAAGGACGC TTAGAGATTT TGCAAGTCCA TTCACGCAAT
AAGAAAGTTG ACCCCAGTGT ATCACTAGAG GCGATCGCTC GTCGCACACC CGGATTTACA
GGTGCAGATT TAGCCAACTT ACTCAACGAA GCCGCTATCC TCACAGCACG TAGACGCAAA
GAAGCAATTA CGATTCTAGA AATTGATGAC GCTGTTGATA GGGTAGTTGC TGGGATGGAA
GGGACACCCC TAGTAGACAG CAAGAGTAAG CGCTTAATTG CTTACCATGA AGTTGGACAT
GGTTTAGTCG GGACGTTATT AAAAGACCAT GACCCAGTGC AGAAAGTCAC CCTGATTCCC
AGAGGACAAG CACAAGGTTT AACTTGGTTT ACTCCCAACG AAGAACAAGG GTTAATCTCT
CGTTCCCAAC TCAAAGCTAG AATTACTTCT ACTTTGGCCG GTCGTGCTGC TGAAGAAATT
GTCTTTGGTA AGCCAGAAGT GACCACAGGT GCGGGTGATG ACCTGCAAAA AGTCACATCA
ATGGCAAGGC AAATGGTGAC AAGGTTTGGT ATGTCTGAAC TAGGCCCCTT ATCTCTGGAA
AATCAAAGTG GCGAGGTATT TTTAGGACGC GACTGGATGA ATAAATCCGA CTATTCTGAA
GAAATAGCTG CCAAGATAGA TTCTCAAGTC CGAGAAATTA TCAACACCTG TTACCAAACA
TCAAAGGAAC TTTTGCAAAC TAACCGCGTG GTTATGGAAC GACTAGTAGA TTTGTTGACA
GAACAAGAAA CTATTGAAGG TGATTTGTTC CGTAAAATTG TTAGCGAAAG TCAAAACCAA
GTGGTTGATG AGCAATTGTC GATGGTAATG GGTAATGGGT AA
 
Protein sequence
MKNFGKKALI KQQSPKRVAW TGALAASLIM LPTMFGGNPV LAQKAERESL SYGELIQKVN 
QEQVKRVELD ETEQIAKVYL KGQKPDAPPI QVRLLEQNNE LINRLKEKNV DFGEISSANS
RAAVGLLINL MWILPLVALM LLFLRRSTNA SSQAMNFGKS RARFQMEAKT GVKFDDVAGI
EEAKEELQEV VTFLKQPERF TAVGARIPKG VLLVGPPGTG KTLLAKAIAG EAAVPFFSIS
GSEFVEMFVG VGASRVRDLF KKAKDNAPCL IFIDEIDAVG RQRGTGIGGG NDEREQTLNQ
LLTEMDGFEG NTGIIIIAAT NRPDVLDSAL LRPGRFDRQV IVDAPDLKGR LEILQVHSRN
KKVDPSVSLE AIARRTPGFT GADLANLLNE AAILTARRRK EAITILEIDD AVDRVVAGME
GTPLVDSKSK RLIAYHEVGH GLVGTLLKDH DPVQKVTLIP RGQAQGLTWF TPNEEQGLIS
RSQLKARITS TLAGRAAEEI VFGKPEVTTG AGDDLQKVTS MARQMVTRFG MSELGPLSLE
NQSGEVFLGR DWMNKSDYSE EIAAKIDSQV REIINTCYQT SKELLQTNRV VMERLVDLLT
EQETIEGDLF RKIVSESQNQ VVDEQLSMVM GNG