Gene Ava_4110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4110 
Symbol 
ID3681498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5123111 
End bp5125300 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content35% 
IMG OID637719457 
Productpeptidase C14, caspase catalytic subunit p20 
Protein accessionYP_324605 
Protein GI75910309 
COG category[R] General function prediction only 
COG ID[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.110256 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCCCC TGGGTATTGG TAGTAGTGAT TCCACTTATA TATTTACAAC AGGAGAAGCA 
AAACTCTGGA TTTTGTTAGT AGGTATTAAC GAATACCAAG ATGTGAGTTT ACCTAATTTA
CGCTATCCGG CAGGTGACTG TGAAGCTTTA GGAGACGCAT TAGCAAAAGT TACACAGAGA
TTTTTACGTA AGGAAGTGAT CATTCATCAT GATTTTATGG AAGACACTCC TACCTTAAAA
ACGGTCTGTC GGAGTCTGGA ACGAATTGTT TCAAAAGCTA AACCTACTGA CTCAATATTT
TTATATTTTT CTGGTCATGG AATGTTAGAG CCAGATACTC AAGATGTTGT GCTGTGCTTA
TCAGATACCA GACAAGAAAA CCTGCCAGAT ACAGGATTAC CTGTGCAAAA ACTGTTACAA
ATCTTAGAGA CTAGTCGTAC CAATCAACAA CTAGTTTGTC TAGATACCTG TCACAGTGGC
GATATGAAAA TGCCTCAAAT AAACAACGCT AGCTTCAGAG AATTAAACAT CACTGAGACA
TTACTGAACC CCGCAACAAA TCTTGTGAAT GCTTTGCGAA AACGCGCTAG TCGTAGTAAA
GGATTTTGTG CTTTGTTATC TTGCGATCAA GGACAGCAGT CTTGGGAGTT TCCGGAACTG
GGACATGGAG CCTTTACTTA TTACTTAGTA CGGGGCCTAT TGGGTGAAGC TGCTGATTCT
CATGGCGTTA TTGAAGCAGA TGGTTTATAT AAATATGTTT ATCGCCAGAC ATTACAATAT
ATTGATAAAT TAAATCACCA ACTGCGTCTG ATAAATAAAC AAAAACTCAA CCGTGGGGAC
AGAAAATTAT ATCCAGAATA TCCTTTGCAA ACGCCTAAAA GAATTGTTGA AGGAGTAGGA
GAATTTATTT TAGGATTCAA ATATGATGCA GATGAATCTC ACCAACAGCG ACGAGCTTTA
GTTATAGATG GACTCTCAAA CAAGATAAAT ACTGATTTGA TTGATATATT TGCTCATGCT
GGTGATTTTC AAGCCGAGTG TTGGCATCAA CAAAGCAAAT CTTGGTCAGA TATAGAGATA
AGAATCAAAG GGTTTCTAGA TAGAGATAGT AAATCAACCA TAAAATCTCC TCCTTATTTA
GAATTAATCA AACCTACACC AACTTGCTTA CTTTATTTAC GTGGGTACAT TGAAGAAAAC
GAAAATGGAG AAGCTTGGTT CATATTAGGC AATGGAGTAC TTCTTAGCCG TTCTTACTTA
AAACAACAGC TACAACGTGC AACTAAGACT CAACAAATAA TCATTTTAGA TTGTCCTAAC
ACTAATTCCC TCAAAAAATG GATAGAATAT TGCCAATGTG GTACAGAGTA TGGGCAATGT
ATAATTGCAG CTACTTCCAC AATAGATAAA TCAGAATTAT TTTGCCAAAC ACTCCACAAC
ATCCTTGCTA GTGTAAATGT ACAAGTTGGT TTGTCAGTTT CTAAATTGAT TGCTGAATTA
CAAAAACGTT TACCAAAACA AGATGTCAGA CTTGATTTTT GGATATCAGA AACGCAAACG
ATAATTGATA TTTTACCTAG TAAGAATTAT CCAAATTTTT CCCAAAGACC TTATCTCCAG
AGAAATCAAC AGGAGCAACA AAAAAAATTC CCATCTCACC ATCTTGATGA TGTGACAGAA
ATTCCCTCTA CTACCTTACC TTTACCATCA CCTATTCCTG TTACTAGTGA AGCTATAGAG
ACAGTAAAAC CTCAACTTAA TTTGCTCCTT AGTTCAGAAC AGCTTACAGA ATTAGAAAAT
TTACTGAAAC AATCATTAGG CATAGTTGCG CCTATAGTTT TAAAAAAAGC TTTAAAAGTA
AATAATGGTA CAGAATTAAT AAGAACATTA GCTAATTATT TACCACATAA GGAACAAGAA
AAATTTAAAG AACAAGCATT ATTTATATTA AATAAAAAAA GTAGTTTTTC TCTCAGTAGA
TCGGCAAATG AACAAACAAT AAATGCAGCT TTTCTTGGTA AATGTGAACG TGAGTTAACC
AATTTAATTG GCTCAGATGC TAGATTGAAT ATTCAACATA TTTTAGAATC TCATACCCAA
ATCACTTCCA AAGAATTGGT AGATAAATTG ATAGCCAAAA TTCCCGATCC ACAGCTAGCT
TTAAAGTTTA AACAGCGTAT ATGGGGTTGA
 
Protein sequence
MSPLGIGSSD STYIFTTGEA KLWILLVGIN EYQDVSLPNL RYPAGDCEAL GDALAKVTQR 
FLRKEVIIHH DFMEDTPTLK TVCRSLERIV SKAKPTDSIF LYFSGHGMLE PDTQDVVLCL
SDTRQENLPD TGLPVQKLLQ ILETSRTNQQ LVCLDTCHSG DMKMPQINNA SFRELNITET
LLNPATNLVN ALRKRASRSK GFCALLSCDQ GQQSWEFPEL GHGAFTYYLV RGLLGEAADS
HGVIEADGLY KYVYRQTLQY IDKLNHQLRL INKQKLNRGD RKLYPEYPLQ TPKRIVEGVG
EFILGFKYDA DESHQQRRAL VIDGLSNKIN TDLIDIFAHA GDFQAECWHQ QSKSWSDIEI
RIKGFLDRDS KSTIKSPPYL ELIKPTPTCL LYLRGYIEEN ENGEAWFILG NGVLLSRSYL
KQQLQRATKT QQIIILDCPN TNSLKKWIEY CQCGTEYGQC IIAATSTIDK SELFCQTLHN
ILASVNVQVG LSVSKLIAEL QKRLPKQDVR LDFWISETQT IIDILPSKNY PNFSQRPYLQ
RNQQEQQKKF PSHHLDDVTE IPSTTLPLPS PIPVTSEAIE TVKPQLNLLL SSEQLTELEN
LLKQSLGIVA PIVLKKALKV NNGTELIRTL ANYLPHKEQE KFKEQALFIL NKKSSFSLSR
SANEQTINAA FLGKCERELT NLIGSDARLN IQHILESHTQ ITSKELVDKL IAKIPDPQLA
LKFKQRIWG