Gene Ava_C0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0226 
Symbol 
ID3678025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp263282 
End bp266677 
Gene Length3396 bp 
Protein Length1131 aa 
Translation table11 
GC content47% 
IMG OID637715306 
Producthelicase-like 
Protein accessionYP_320500 
Protein GI75812883 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0240002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAATA CTCCTGCCGA AGTGCGTTCA TGTCTCATTG ATGCCCTACA ACTTGACTTA 
GTGGGGCCTA CGCCCAATGA TATCGCCCAT GTTGACGAAA TTATTGACCA AGCTCCATCT
AAGTGGTATC TCACGGGTTT TTTAGTGCCT TATGAAGCTT CGGTAGAACA GCGTTCAGAA
GATATAGGTA ACGATGACAT AGACGAGATT TCTCAGGTGA ACGCTGGCGA TGATGAAAAG
CAACCCGACA GTGCCTCCGC CCGGAGAGCA TTTTTCCCCT CATCAATGGG TTTAAGCATC
CTTGTACCAG CAACTGCTAA GGAAATAAAT GTTACTGTTC ACTGGGGCGA TTACAGTCCG
GTTGATGAAG AGGATGAAGA AAATCAAGAA GACTCTAAAA GCCAACTCCA GTTGCCCCGG
CTGTGGCAAA GAACCCCCGG ACAAGCAGAA TTAACTGTTC CTCTCCATTT AAGTAATGTC
CCCAAACATT GGGAAATTCC TGGTAGTAAC GGTTTGAGAC TCGTTACCTC AGTGCGTCCC
GTAACTGCGG CGGAGTTAGT TCCTGTTGGT ACTCGTTCCG TATCCGTATT CCTGGTGAAC
TATCGCCCGC CAGTAGCCAA CCCCTACAGC GATATTGCTT TCGCCTTCCA AACTTGCCTC
ATCATCCGCA CCCCTTCTTC TCTCGTTCCT CGTCCCAACC TGCGCGGACG ACACGGTGAT
GATTGGGATG AGAAGGTAGC AGATTTACAG TATCGAGATG ATTATGAGTA CGCAGTGGGA
CATAACGTTT CAGCTGTCGC TGTCACTAAC GATGATGCTA CTTGTCAAGA AGTCCGCACT
GCTTGGATGC CCATTGCTGA TGTAGAAAAG GTAGTACCTG AAAAAGTTCC AGGCGTGGAA
CTGGGAATGG AAGCACTCGC CGCCGCACCC ACGGTGGAAA CTCTGCGAAA TATGATGTCT
GGGATAGTTG ATGCTTATAG GGTGTGGATT GAAGCACAAA AGCTAAATCT TCCTAATGAT
CCCGAACGAC TGGAAGTTGC TAACGATTTA CTCAACCGGG CAACCAGGGC AAACAAACGC
ATCGCCGCCG GATTAAAAGC ACTAGATGAT CCTAATGTAT TAGAAGCATT TCAAATTGCC
AATCGCGCGA TCGCCACTGC TATCCGTCAG CGTCTTACCC ACAATACAGA TACTACCCCA
GAATCAGTCA AGCCTCCGGC ATGGCGACCT TTTCAGCTGG CGTTTTTATT AATGAACTTG
GTTGGTATCG CCTATCCCGA ACACCCTGAC CGAGAATTAG TAGACTTGCT GTTTTTCCCC
ACAGGCGGCG GTAAAACTGA AGCTTACTTA GGATTAGCAG CATTCGCAAT GGTATTGCGC
CGCCTGCGAA ACCCCACAAT AAACTCAGCC GGCGTAAGTG TCTTGATGCG TTATACCCTG
CGCCTGCTTA CCCTTGACCA ATTAAGCCGT GCTGCGACTC TTGTCTGCGC CTTAGAGTTA
GAAAGACAAA AAGATACACA AAAATTAGGC CCCTGGCCTT TTGAAATTGG ACTATGGGTA
GGACAAACCG CTACTCCCAA CCGTATGGGA AAAAAAGGCG ACAACGATGA ATACACCGCC
CGCGCCCGTA CTATTGCCTT TCAAAATGAC ACCCGCAAAC CTTCACCCAT CCCTTTAGAA
AACTGTCCTT GGTGCGGTAA ACGGTTTACC TCTGACTCCT TCCAATTACT CCCAGATGCA
AATCAGCCAA AATCTCTGCA AATTACTTGT ATCAACCGAA AATGCAAATT TACCCGCAAT
CAATCGTTGC CCATCGTTGC CGTAGATGAA CCAATTTACC AACGATTACC CAGCTTTATT
ATCGCCACTG TCGATAAATT CGCCAATCTC CCTTGGGTAG GAGAAACTGG GGCATTATTT
GGACTGGTAG ACCGTTATGA CAAAGATGGT TTTTACGGGC CTGCCCATCC CGGTCGCGGT
CAAGCACTTG CAGGTCATTT ACCAGCCCCA GACTTGATTA TTCAAGACGA GTTGCACCTA
ATTTCCGGCC CCTTGGGAAC AATGGTAGGG TTATATGAGA CTGCCATTGA CGAACTGAGC
AGCAGAGAAA TTAACGGTAA AAAAATACGC CCTAAAATTA TTGCTTCTAC CGCAACAGTA
CGGAGAGCTA GTAAACAAAT TCGAGCCTTA TTTGGTCGAG ATGCTGTAGA TATTTTCCCA
CCTCCCGGCC CCGATCGCCG CGATTCATTT TTCGCTAAAA CAGTACCAGC AAGTGAAAGT
AATGCCCGTA CCTATGTAGG CATTGCGGCC CAGGGACGAA GCTTAAAAGT GGTACTGTTA
CGAACTTACT TAGCACTACT GGGTGCTGCA CAGAAACATT ATCAAGCAGC CGGAGGAGCA
AAAAATCCTG ATAACCCCGC AGATCCTTAC ATGACCTTGC TGGGATATTT TAACTCCCTA
CGCGAACTAG GTGGTAGTCG CCGCATCGTT GAAGATGAAG TCAACTCTCG TTTAGCAAGG
TATAGCCTGA GAAAACGAGT CAACGAAACT GAAGGTTTAT TTGCTGACCG TCAAATTGCC
TATGAACCGG CGGAATTGAC TTCCCGTGTC AGCACTAATG TTGTTGCTGA AATCAAAAGC
TGCTTGGCAC TACCATTTCA CGAAAAGAAA CATATCGATG TAGCCTTAGC AACAAATATG
ATATCCGTGG GTTTGGATAT CACCCGTTTA GGGTTAATGG TCGTGTTGGG TCAACCAAAA
ACAGCATCCG AGTATATTCA ATCTACCAGT CGGGTGGGAC GGGATGAAAA TCGCCCTGGT
TTGGTGATTA CATTATTAAA TATACATCGA CCACGCGATC GCTCTCACTA CGAACGCTTC
CCAGCTTGGC ATACCAGCTT TTATCGTTCT GTAGAAGCAA CTAGCGTCAC TCCATTTTCA
CCTCGTGCTA TTGACAGGGG TATTGCTGCT ATTTCCGTTG CCTTGGCGCG TTTAGGACAT
CCTGGCATGA CTGCACCACC CCGCGCTATC GAGATTTTAC AACATCGGCA GGATTTAGAA
TATGTTGTCG ATGCCATTAG CGATCGCGCA GAAATGCACG ATAAAGAACT TGATGCCGTA
GAAGCCGAAG CACTACGTCA AAAAGTTCGC GGACGGGTGA AAGATTTACT AGACACTTGG
GAACGCATTG CTAGTCAAAA AATCAGCTTG CAATACCAAC AAGAAGTAGG TCAAGCACCG
CCATTATTAT TCGACCCCCT TGACCCAGAA CTTGAAAAGC AACCAATGGA AGCACGCAAG
TTCAAAGCAC AACGCAGCCT GCGAGATGTG GAACCAACAG TCAACTTGTG GGTTTGCAAC
CCTGATGGTT TTGAGGTTGA GGAGGACGAA AAATGA
 
Protein sequence
MPNTPAEVRS CLIDALQLDL VGPTPNDIAH VDEIIDQAPS KWYLTGFLVP YEASVEQRSE 
DIGNDDIDEI SQVNAGDDEK QPDSASARRA FFPSSMGLSI LVPATAKEIN VTVHWGDYSP
VDEEDEENQE DSKSQLQLPR LWQRTPGQAE LTVPLHLSNV PKHWEIPGSN GLRLVTSVRP
VTAAELVPVG TRSVSVFLVN YRPPVANPYS DIAFAFQTCL IIRTPSSLVP RPNLRGRHGD
DWDEKVADLQ YRDDYEYAVG HNVSAVAVTN DDATCQEVRT AWMPIADVEK VVPEKVPGVE
LGMEALAAAP TVETLRNMMS GIVDAYRVWI EAQKLNLPND PERLEVANDL LNRATRANKR
IAAGLKALDD PNVLEAFQIA NRAIATAIRQ RLTHNTDTTP ESVKPPAWRP FQLAFLLMNL
VGIAYPEHPD RELVDLLFFP TGGGKTEAYL GLAAFAMVLR RLRNPTINSA GVSVLMRYTL
RLLTLDQLSR AATLVCALEL ERQKDTQKLG PWPFEIGLWV GQTATPNRMG KKGDNDEYTA
RARTIAFQND TRKPSPIPLE NCPWCGKRFT SDSFQLLPDA NQPKSLQITC INRKCKFTRN
QSLPIVAVDE PIYQRLPSFI IATVDKFANL PWVGETGALF GLVDRYDKDG FYGPAHPGRG
QALAGHLPAP DLIIQDELHL ISGPLGTMVG LYETAIDELS SREINGKKIR PKIIASTATV
RRASKQIRAL FGRDAVDIFP PPGPDRRDSF FAKTVPASES NARTYVGIAA QGRSLKVVLL
RTYLALLGAA QKHYQAAGGA KNPDNPADPY MTLLGYFNSL RELGGSRRIV EDEVNSRLAR
YSLRKRVNET EGLFADRQIA YEPAELTSRV STNVVAEIKS CLALPFHEKK HIDVALATNM
ISVGLDITRL GLMVVLGQPK TASEYIQSTS RVGRDENRPG LVITLLNIHR PRDRSHYERF
PAWHTSFYRS VEATSVTPFS PRAIDRGIAA ISVALARLGH PGMTAPPRAI EILQHRQDLE
YVVDAISDRA EMHDKELDAV EAEALRQKVR GRVKDLLDTW ERIASQKISL QYQQEVGQAP
PLLFDPLDPE LEKQPMEARK FKAQRSLRDV EPTVNLWVCN PDGFEVEEDE K