Gene Ava_4849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4849 
Symbol 
ID3679347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6105041 
End bp6107962 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content42% 
IMG OID637720206 
Productpentapeptide repeat-containing protein 
Protein accessionYP_325341 
Protein GI75911045 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.692365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0232391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGC GTTTCTGGCA AACTTGGCAA GAATTTAGGC AGTCATTTTC AGTATCAGAA 
AGCCTAAGTA CAAGTGTCGA GACAGGTAAG GCAGTCTTGG AAGCTGCCAA TACTCTCAAA
GAAGAGGGTG ATAGTATAGA AATCTTGCAA TCAGTCCTGC AAAACTCATC TTCTTTATTA
GATGTGTTGT GTTCGCCAAT GGCGCAGGTT ATAGGTGCGG GGTTGCCCTT TGTACCAATT
GGCATTGCTT TGTTAAAGTT TGCCCGTGAT ATAAATCAGA AAGAACCATC CTTAGAAGAC
TGCTTTTTTA TAGTTAGTCA AGCGGCTTAT TTGGAAAGCA CCAAAGAAAT TTTAAGTTTA
AATATATATC AAAATTTTAA CTGGGATGCC AAGCTTGATA TTCAAGCCAT CAGTCAGCAA
ATAGAAAAGC TCAATGATGT TGAATTTAAT TCTGATACTG CGAGTAAAGC AATCAGATGT
TTCCATGAAT CACCTTTAGC TGAAGCTTTC AACAGGGTTT TATTAGCTAG ATTAGCAGCA
GCAAATATTT CCCCAGGACT AGCGGACATT TTAACCCAAC GTGTGGCTCG GAACACTCAC
CGCCATATCA TTAAAGCTTG GATAGAAGCG GGTGAGGCGA TAAAAACTTT AATTCAGCCT
TCCTTGGGTG ATTGGCAGAG AGAACAGGAA AGGTTTCAGA GTATTGATAA TTATTTAAAA
ACTCATATTG AGCAGAAACC TTTTGAGTTA GTTTTTGATG AAAAGTTCGC CTTTAAAGAT
ATTTATGTAC CTATCAAAGC CAAGCCTGTA GATGCAAATG GCAAGATAGA TGAAGAAAAG
GATTCATTCA ATTTAGACAC TTGGGCGAAA ACGATTCTGT TGAATCCTGA TAACTTGGAA
CAGGTGATGT TTATCCAAGG AGGCCCAGGC AGGGGGAAAA GTGTTTTCTG TCGAATGTTT
GCTTACACAG TGTGGCGACA GTTACACCCA ATTTGGACAC CAATCTTAAT TCGCTTGAGA
GATATCGACA CTTTTGAAAC ACGGTTAGAG AATACTATTA AAGCCGAATT AAAACTGGGT
TTTATTCAAG GTGATGCTAA CTGGTTGACT AATGCCAATA CTCGGTTTTT GTTTATTCTT
GATGGTTTTG ATGAACTGCA TATTGAAACG AGAAATAACC TCAATTTGGG CGATTTTATT
AAACAAGTAG CTGGTTTTCA GAAAGAATGC AAAGATTATA GGGAAATGGG GCATCGAGTT
ATCATTACTG GTAGGTCGAT GGCTTTACAA GGTATTGCTG ATTTACCCCG CAATTTGGAA
CGGGTGGAGA TTGTGGAAAT GGATGGGCAA CTCCAACAGC AATGGTTAAA TAAGTGGGAA
GCTGTACAAG TAAATAAAGG TAAAACCATT GCGTTTGAGC AGTTTTTACA AAGTGATAAA
TGCCCCGATG AAGTTAAGAA ATTAGCTCAG GAACCGCTAT TGCTCTATTT ATTAGCGGCA
ATGTATCGAG ATTCTAAATT AGATATTCAT AAGTTAGAGC AGGCAAGTGA TAATCGCACC
GCTAAAATTA TCATTTACCA AGAAGCTGTA AATTGGGTAC TGACTAAACA GCGTTCTGAA
CCAGATGGAA CTGATTTAAA TATTGAGTTA ACTAAACAAA AGCCTGAGGA TTTAAAACGC
ATCCTCATGG AAGCTGCGGT TTGTGTTGTG CAGTCTGGTG GTGAGTTTGC TTCTATGTCC
ATGTTAGAAG CACGTTTACA AGAAGATGAA GGAGCGAAGG CTTTAATTGA AAAAGCGAAA
GAAAAACTGG GGAATGAAGC ACTGAAAACG GCTTTGGCTG CATTTTACAT TCGTCCGGCG
GAAAAACAAG AAGGTGGGGT TGAGTTCTTC CATAAAAGTT TTGGGGAGTT TCTCTTTGCT
GAACGCCTGA AAGCACGGCT TAAGGCTTGG ACGCAATATT ACGATGGGGA TGAGGGGAGA
CAGCCAATTA TTTCTGAAGC TGTAATGAAT TGGGAAATCT ATGATTTACT CGGTTACGGT
GGGTTAACAC AGGAAATTGT AGACTACCTG ATGGGGTTGT TAACTGAGAG TCAAGATTTC
CGGTGGGTGG AATTATTTAA GCGGTTAGAC AAGTTTTATA GTAAGTGGTG TCAAGGGAAA
TTTATTGACA CATCTGAGGA GACTTTACCC CAGAAGAAGT TGCGACAGTT GCAAAGGTAT
GGCATTCAAG GGTTAGGTCA GCGTCAGGTG GATGTTTATG CTGGGTTGAA TGTGATGATT
CTGCTGTTGG AGTTACACCG CTATGCTCAA GGGCGAGATG AGCTGAAAGC AGAAATTGTC
TTTTATCCAT CTGGGAAACC GCAGGGACAT AGGCTCACAG CCCGATTGCT TCGCATCATG
AATTATAGTG ATGGGTTGGA TTTAGGGAAC TTTATTAGGA TAGTTGGCAA ATTCCTCAGA
GGCGCAGACC TCAGTGGCGC AGACCTCAGT GGCGCATTCC TCAAAGGAGT ATTCCTCAGA
AGCGCAGACC TTAGTGGTGC ATATCTCAGA GGTGCAGACC TCAGAGATGC ATACCTCAAT
GGCGCAGACC TCAGTGGCGC AGACCTCAGT GGCGCATACC TCAATGGCGC ATACCTCAAT
GGTGCATACC TCAATGGCGC ATACCTCAGC CACGCAGACC TCAGTCGTGC AGACCTCAGA
AGTGCAGACC TCAGAAGTGC AAACCTCATT AGTGCAGACC TCATTAGCGC AGACCTCATT
AGCGCAGACC TCAATGGTGC AGACCTCAGT CACGCAAACC TCGGTGATGA ATTTTGGGGA
GATGTCAAAT GGGATGAAAA GACAAACTGG GAGAATGTAC GAGGGCTGGA TACAGCGATT
AATGTGCCAG AAGCGTTAAA GCGACAGTTA GGACTGAGTT AA
 
Protein sequence
MGKRFWQTWQ EFRQSFSVSE SLSTSVETGK AVLEAANTLK EEGDSIEILQ SVLQNSSSLL 
DVLCSPMAQV IGAGLPFVPI GIALLKFARD INQKEPSLED CFFIVSQAAY LESTKEILSL
NIYQNFNWDA KLDIQAISQQ IEKLNDVEFN SDTASKAIRC FHESPLAEAF NRVLLARLAA
ANISPGLADI LTQRVARNTH RHIIKAWIEA GEAIKTLIQP SLGDWQREQE RFQSIDNYLK
THIEQKPFEL VFDEKFAFKD IYVPIKAKPV DANGKIDEEK DSFNLDTWAK TILLNPDNLE
QVMFIQGGPG RGKSVFCRMF AYTVWRQLHP IWTPILIRLR DIDTFETRLE NTIKAELKLG
FIQGDANWLT NANTRFLFIL DGFDELHIET RNNLNLGDFI KQVAGFQKEC KDYREMGHRV
IITGRSMALQ GIADLPRNLE RVEIVEMDGQ LQQQWLNKWE AVQVNKGKTI AFEQFLQSDK
CPDEVKKLAQ EPLLLYLLAA MYRDSKLDIH KLEQASDNRT AKIIIYQEAV NWVLTKQRSE
PDGTDLNIEL TKQKPEDLKR ILMEAAVCVV QSGGEFASMS MLEARLQEDE GAKALIEKAK
EKLGNEALKT ALAAFYIRPA EKQEGGVEFF HKSFGEFLFA ERLKARLKAW TQYYDGDEGR
QPIISEAVMN WEIYDLLGYG GLTQEIVDYL MGLLTESQDF RWVELFKRLD KFYSKWCQGK
FIDTSEETLP QKKLRQLQRY GIQGLGQRQV DVYAGLNVMI LLLELHRYAQ GRDELKAEIV
FYPSGKPQGH RLTARLLRIM NYSDGLDLGN FIRIVGKFLR GADLSGADLS GAFLKGVFLR
SADLSGAYLR GADLRDAYLN GADLSGADLS GAYLNGAYLN GAYLNGAYLS HADLSRADLR
SADLRSANLI SADLISADLI SADLNGADLS HANLGDEFWG DVKWDEKTNW ENVRGLDTAI
NVPEALKRQL GLS