Gene Ava_0537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0537 
Symbol 
ID3682367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp675190 
End bp678069 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content39% 
IMG OID637715865 
Productpentapeptide repeat-containing protein 
Protein accessionYP_321056 
Protein GI75906760 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.884465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTCA GTCTCCGGCA ATGGTTGGCA GAACGCCAGA TAGAAATCAG CGATATAAAA 
ACATTTGCTT CTGGGCAACT GGCAGGTATT GCCTATCGTA TCGTTCAGGA TATGGAACTT
AAAAGCCTGA TGCCGTTGGA TATTTGTACG TTAGCAGAAG TGCTGCAATT ACCATTAGGC
ACAGTCGAGG AAGAAATTTC TGTGCTTGCT TCTTTAAGCG AACATCTTCT CCGTCACCTC
AGCCAAAAAA AAGCCCTCAA ACGTAATGAA GGTACTTGGT TAGCATTCCA AATTGCTTAC
CTATTGGCGT TAGAACAAGT TTTACTCCAA GAAGAACAAT TAAAAAGACC TTGGCTAAAT
CGTGCCAAAA TACCTGCACT AGCCACAATA ATTATCTCAG ATCCTCAACT CCAAGGATTA
TTAAAAACTC TCTCCCCTGG TAAATTAACT GATACTCAAG CCGAACAAGC GCTGATTTCC
GTGGCAGATT CATTACTAGT ACAACAAATG AATCATGCTA CTGTAGCCTG GTTAATGGCC
AATGGTGCGG AGGAGTTAGA AGCTAAATTA TTAACTCAGC GTTTAGATAA TTCTCTCCCC
GGATACCTGA TAAAAATCAT TGCTCAAAAC CCTGCACCTT TAGCTCAGCT ACAAAAATTT
TTCCGGATAG GAACTTTAGA AGATGTTTTA AATATTGACT TATATAAAGA GAAGTATCGC
GCTAGTTTGC TGCAAACTCT CAGTACACCA TTATTGATGG AATATTTTGC CCTGAAGAAT
ATTTATGTCC CTTTATCTGG TGTACCTCAA GAACCTAATG ATGGACAGTC TATTGATTTA
AAAATATGGG TAGAAAAACA ACTTATTGAT TTAGAAACTA TTGCTGTAAT TGAATCAGAA
CCTGGTTATG GTAAAACTAG TTTTTGTCAG ATTTGGGCGG CAGAAGTAGC ATTAAAGCTT
TATCCTCATT GGCTGCCCAT ACTGATTCGG TTGCGGGATA TTAAATATGG TAAAAGTTTA
CTGGAAACCC TCAATTCTGG TTTTACATTC AATGTTCATA TCAACTTATC CGCTTGGTTA
GAGCAAACAA ATAATCGGTG TGTTTTATTA CTAGATGGAT TAGATGAACT TCCCTCTTCT
CATCAGGGAA ATAGAGATAA ACAAATTTTT ATTCAACAGT TACTCCAATT TCAATCCCAA
GAACAACATA AAATTGTCTT GACTAGTTGC TCTCACACAG TAGAGGAAAT TACTTCAGAA
ATCCCCCTCC AATGGCGGAC TATTAAAATT CAACCTTTAG AAGTAAACCA GTTAAAACAA
TGGTTTCAAC AATGGGCATT ATTCCAGTCG TTACCCATTT CCCAAAACTT TTTTACATTC
TTAAAACAAG CAGGATTATT TGCTAATAAG TCTCACTTAT CTGAACTATC TCATTTAGTA
CGTCAACCAT TAATGTTGCA TTTATTGGGT GTTTTCCACC GCGACGGACT ATTAGATGAT
GAAATATTGC AAAAAAATGC CAAGTTTTCT CTCCATTGGG AAATTTACTC TCGCCTAAAA
CGATGGTTGC TGGGGTATCC GTTGACTGCG GGAATGAGGA CAATGCTATT ACGTCCGGGA
ACTGCTCATA TTCACCGTAC ACCAGAAGCA ATCACCAATT TACTAGGAGA ATACCATCCC
CAAGACCTGA TTGCACAAAT GCAGGCGATC GCTCTCAAAA TTTTACATGG CGATCGTCAT
CAAGTTACCT TAACTGGAGA ATTCAATACA AACACTCTCC CGGCTTTATA TTTCCGTTAT
TGTGTTAGCA GTCAGTTGCC ACAAGCTAAT GACAAAGCAC AACTAACAGT TAAAGTAGAG
TTTTCCCACT CACAAATCGG TGAATATCTC TGTGCTGAAG CTGTAGCAAC CCAACTGCAA
AGGCTAACCC AGTGCCAAGA AGATGTTTAC GGAACAGAAA CGTTTATTTT CAATACTCCT
AGCAGTGTTG CCCAGCATCT CTACAATTTA CTAGGCTACG GCATTATTAC ACCAGAAATT
GAGGGATTAG TCATCGCCGC ATTACAAACC CAGCAAAAAC CTATTTTATT ACGACGACTA
GAATCTTTTT GGCGTGGTTG GTGTCAAGGA CGTTGGTTAG ATGAAAGCAT TGCCCACACA
GCACTACCCT ATTTTCACAG CTTACAAAAT CTTGTGAATG TTGAGCAGGT GAATACAAAT
GTGGGAATAA ATGTATTTTT ATTGCTAGCT GCAATTTGTC GAGACATTCA AGTTGCTTTT
AGTCCTTGTG GTAATCCAGC AAATGTCAGT GAGTTTTACC CCCAAACAAT GATGATGTTG
TTAGCTAAAG CCTCTTTTTC TGGAAGTAAC ACTCTCATAA AGCGCATCTG TTCCCAATCT
TTAGCTAAAA TCAACCTTTC AGGGGCTTTT TTATCGTCAG TCGTCCTGAC TGGCGCAAAT
CTTGAACAAG CAAATTTATG CGATGCTGTA TTGGTGAATG CAAATTTAGC TGGTGCTAAC
TTAAATAACG CAAATTTAGC TGGTGCTAAC TTAGCCGGCG CAAATCTCAC CGATGTCAGC
TTGGAAACCG TCAATCTTAC CAATGCTTGT TTATGTGATG TTCCCCTCAC GGAGGCTGAA
AGAGAAATTG CCCAATTCCA CGGCGCATTA TTTTCTCTAG AACAATTTCA AGTAGTGAAA
AGTTTATTAT CCAAGCAATA TTACCTTAGT ATTTCTAGTA CCAAAGAGAA AACTAAGTTT
TGGAATCAAA ATAGTCTCGA TACCGGCTTA ATTGAAAGCT TGGAAGGTGA AGTAATTATG
CCTACAATTT TAGATCACGA GACCTATGAT GAGACGGTTT TTGGTAAAAG TGAGTTATAA
 
Protein sequence
MNLSLRQWLA ERQIEISDIK TFASGQLAGI AYRIVQDMEL KSLMPLDICT LAEVLQLPLG 
TVEEEISVLA SLSEHLLRHL SQKKALKRNE GTWLAFQIAY LLALEQVLLQ EEQLKRPWLN
RAKIPALATI IISDPQLQGL LKTLSPGKLT DTQAEQALIS VADSLLVQQM NHATVAWLMA
NGAEELEAKL LTQRLDNSLP GYLIKIIAQN PAPLAQLQKF FRIGTLEDVL NIDLYKEKYR
ASLLQTLSTP LLMEYFALKN IYVPLSGVPQ EPNDGQSIDL KIWVEKQLID LETIAVIESE
PGYGKTSFCQ IWAAEVALKL YPHWLPILIR LRDIKYGKSL LETLNSGFTF NVHINLSAWL
EQTNNRCVLL LDGLDELPSS HQGNRDKQIF IQQLLQFQSQ EQHKIVLTSC SHTVEEITSE
IPLQWRTIKI QPLEVNQLKQ WFQQWALFQS LPISQNFFTF LKQAGLFANK SHLSELSHLV
RQPLMLHLLG VFHRDGLLDD EILQKNAKFS LHWEIYSRLK RWLLGYPLTA GMRTMLLRPG
TAHIHRTPEA ITNLLGEYHP QDLIAQMQAI ALKILHGDRH QVTLTGEFNT NTLPALYFRY
CVSSQLPQAN DKAQLTVKVE FSHSQIGEYL CAEAVATQLQ RLTQCQEDVY GTETFIFNTP
SSVAQHLYNL LGYGIITPEI EGLVIAALQT QQKPILLRRL ESFWRGWCQG RWLDESIAHT
ALPYFHSLQN LVNVEQVNTN VGINVFLLLA AICRDIQVAF SPCGNPANVS EFYPQTMMML
LAKASFSGSN TLIKRICSQS LAKINLSGAF LSSVVLTGAN LEQANLCDAV LVNANLAGAN
LNNANLAGAN LAGANLTDVS LETVNLTNAC LCDVPLTEAE REIAQFHGAL FSLEQFQVVK
SLLSKQYYLS ISSTKEKTKF WNQNSLDTGL IESLEGEVIM PTILDHETYD ETVFGKSEL