Gene Ava_0605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0605 
Symbol 
ID3678538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp760191 
End bp762374 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content46% 
IMG OID637715933 
Productpentapeptide repeat-containing protein 
Protein accessionYP_321124 
Protein GI75906828 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000848072 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.230726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTC CAATTGTTAA AAAAAGTAAT AATCCACCAA GCAAAAATTT AGCAACACCC 
AATTCACTAC CCCTAGCTAC AAGGCGCTTG GCCGCTTGGG CAACGGAAAT TACCTTATTG
GCTGCCACCG GTTTGGTTCC TTTTGGTCTA GGGGCGTATA TCAATTCCAG AAGTGATATT
AATCGAGAAC CCCTCAACCC AGGATTAGTA GTTGTAGAAA GAGCGATCGC CAGACCTTTG
GCATTACCGG CAGACTATGG TGTACGAAAT GTAGCTTGGC CGACTAACTA CCTATGGATG
TTAGCTTTGT TAGCACCCAC AGCTCTTTCT TGGTGGCAAT TATACTTACT AGCAAAAACG
GGTAGTACTC TGCCTAAGCG TTGGTTTGGG GTGAAGGTAG TCAATGAAGA AGGTACTCCC
CCAGGTTTAG CCGCCGTTGT CGTCCGTGAA GGTATTGGTC GTTGGACTGT ACCCATGTCT
GTTGCTTACA TTCTGTGGCG CTACAGTTTT GCTTTTCCCA ATTTGGGCTT GTTTACATCA
TTGGCAGTGT TAATGGTCAT AGGTGAGGCT TTGGCTTTAC CCGCCCGTCG GGGACGGAAA
GCCTTACATG ATTGGTTGGC GGGTACTTAT GTAGTCGATG CTAATCGCCC TGTAGCATCC
CCAGATTTAG CCCCCAGTGG AGGAGGTTTA TCTGGTATCA GTCCTCAACC TGAAGAAGGG
AATACTGCCT TAGCCACAAC GGCAATGGCT ATGAGCTATC CCCAAGCAGA AGTCATCACC
ACGGACAACA GTAACTTGAT TTCCTTGTGG CGACGGATGC AGCAAAACCC CAGCCTCACC
TTATTTGGTG TTGCCCTCAC CAGTATGACG GCTGTACTAG CTACTCTAAT TGGGACTCAA
GTTTATATTC AGACTCAGCA AGGGAATCGG GAATCGCAGA AAATTAACAG TCAGCAGTTC
TTGGAACTAG TGAAACAATT AAGTCCTGAG TCTGGAGCCA GCATTGAAGA CCGTCAGAGG
ACAATTTTGG CTTTGGGTAG CCTGAAAGAT TTCCAATCTA TCCAATTTTT GACGGACATG
ATGGTGAAGG AAACTAACCC TATTCTCATA GATACCATCC AACAGGCACT CACCAGCGTA
GGCACCGCCG CCATCCCCGA ATTACAAAAC AAAAATCAGT TTTTGGCGAC AGAATTAGAC
TCTGTTGGTA GCGCATCCCC AGAACGGGAA GTTCGCCAAA AACGTTTACA AACTAACCAG
CAGACAATTA ATAAAATTCT CAATGTTTAT GGTGGTAAAA CTTTAGGCCT TGACCTGAGT
CGGACTCAAC TAGGCCAAAG CGGGACTGTG GGTGGTTCGT TTTTTAACTT GGTTTTAGAC
AATATTGATT TATCAGGCAT TAAGTTCAAA TCTGCCAATC TTAACCAAGC TAGTTTTAAG
GGTAGCCGTT TTCGCAGTGT CGGTGATGAT GGGCGCTTGG ACACCTATGA TGATGCGATC
GCTGATTTAA GTCAAGCCCA GATGAAACAA GCCAATTTCA CTGATGCTAA CCTCAGCCGC
GTCCTCATGA CTCGTAGCGA TTTAAGCCGC GCCACCCTCA ACAGAGCTAA TTTATCTAAT
GCACGCTTGA TTGGTGCTAA CCTCAGCAGC GCCCAATTAG TAGGAGCTGA TTTGCGGGGT
ACAGTTTTAG AAAATGCCAG CTTGACAGGG GCTGATTTAG GTGATGCTAA ATTACAAGAA
GCCAACCTCT ACGGTGCGCG TCTTAGTCGA GTTATCGCCA TAGGCGCTCA ATTATCCTTT
GCCAACTTAA CTAAAACTGA TTGGCAAAGT TCCGACCTCT CCGGCGCTGA TTTAGAACGG
GCAAATCTCA GCAATGCTGA CCTCAGCGCC ACTCGCATGA CAGGGGCAAT CTTACGCTCT
GCTCAACTAG AAAACGCTAA CCTACGCAAT GCTGATTTAA GTTTGGTCGA TTTGCGGGGA
GCTAATGTCG CCGGTGCTGA TTTTAAAGAC ACAATTCTCA CACCCAACAA ACAAGACCCA
GCAGACCAAT TCGTACAAAC CCCAGAATTA GGTTCTGTAT CTGCGGTAGT TAAAGGGGTA
GATTTTTCTC AGGCTAAAAA TCTGGATGGC AAACAACTAG CTTACATTTG CACTCAAGGC
GGTATTCATC CACGTTGCCC GTAG
 
Protein sequence
MTTPIVKKSN NPPSKNLATP NSLPLATRRL AAWATEITLL AATGLVPFGL GAYINSRSDI 
NREPLNPGLV VVERAIARPL ALPADYGVRN VAWPTNYLWM LALLAPTALS WWQLYLLAKT
GSTLPKRWFG VKVVNEEGTP PGLAAVVVRE GIGRWTVPMS VAYILWRYSF AFPNLGLFTS
LAVLMVIGEA LALPARRGRK ALHDWLAGTY VVDANRPVAS PDLAPSGGGL SGISPQPEEG
NTALATTAMA MSYPQAEVIT TDNSNLISLW RRMQQNPSLT LFGVALTSMT AVLATLIGTQ
VYIQTQQGNR ESQKINSQQF LELVKQLSPE SGASIEDRQR TILALGSLKD FQSIQFLTDM
MVKETNPILI DTIQQALTSV GTAAIPELQN KNQFLATELD SVGSASPERE VRQKRLQTNQ
QTINKILNVY GGKTLGLDLS RTQLGQSGTV GGSFFNLVLD NIDLSGIKFK SANLNQASFK
GSRFRSVGDD GRLDTYDDAI ADLSQAQMKQ ANFTDANLSR VLMTRSDLSR ATLNRANLSN
ARLIGANLSS AQLVGADLRG TVLENASLTG ADLGDAKLQE ANLYGARLSR VIAIGAQLSF
ANLTKTDWQS SDLSGADLER ANLSNADLSA TRMTGAILRS AQLENANLRN ADLSLVDLRG
ANVAGADFKD TILTPNKQDP ADQFVQTPEL GSVSAVVKGV DFSQAKNLDG KQLAYICTQG
GIHPRCP