Gene Ava_C0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0223 
Symbol 
ID3678022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp257154 
End bp261182 
Gene Length4029 bp 
Protein Length1342 aa 
Translation table11 
GC content46% 
IMG OID637715303 
Producthypothetical protein 
Protein accessionYP_320497 
Protein GI75812880 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0122903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTG ACCCAGAAAT CACCAGACAT AAAGAGTGGT TGGGTTTTCT CCAGCCTGTA 
GGGTTGGTGG TATCTCCCCC AGCGTTGGTG AAGGCGCAGG CGGTGGTCAA CCGGAATGTT
GTAGATTTTC AACAATCGTT GCTGGCTGCG GTTGATGAAG ATGGCTTGAT TGCTGATTTT
CCCGCTTTTA CTGTTAATGT ATTAGGCTGG GCTGAGAGTG ACCTTGTTAA ACCATCAGCA
GAATATGAGA TAGCTTTACC TGATTACAGC GAAATTCTTG CGCCTACCTA TATTGTGCCT
GACCCAGATA GCGACAAGCC GCAGATTTTG GTGCAAATAA TTTCTCCTGG TACCGAGTTA
GATGTAGTCG CACCAGAATT AACTAAGTCT AGCAGTGGTT GGCACGCCAG CCCCCAAGCT
AAATTTGAGC GTTTGTTGCG GGAGACGCAA ATACCCATAG GTTTGTTGTG TAATGGCGGG
TTGCTGCGTT TGGTGTATGC GCCACGGGGC GAGTCTTCTG GACATTTGAC CTTTCCAGTG
CAGGCGATGT GTGAGGTTTC TGGAAGGCTG ATTTTGGGGG CGATGGAAAT GCTGCTATCG
GCATTTCGCG TGTTTAGTGC TACTACAGGT CGTTCTTTAA AAAATTTGTT GGAAGACAGC
CGTAAGTACC AAGCGGAAGT TTCTACAACC TTGGCTAACC AAGTGCTGGA TGCTTTGTGG
GAACTGCTGC GTGGGTTTCA AATGGCGGAT GCGGCAGTTG ATGGTAAGTT ATTAAGTGAA
ATTGCTGCTA CAGATCCGCA ACACATTTAC GGTGGATTAA TCACTACGCT GATGCGGCTG
GTGTTTTTGC TTTACGCTGA GGATGAAGGG CTAATGCCGC CAGATGATAT TTACCAGCGT
AACTATTCTG TAACTGGCTT GTATGAACGG CTACGTGAAG ATGCAGGAAA CTACCCCGAC
ACGATGGATC AGCGTTATGG GGCCTGGGTA TGGCTGTTAA GTTTATTTCG CTTAGTTTAT
GATGGGGGCG GACAAACACC AGAGTATTTA CCAGCACGGC ATGGTCAGCT TTTTGACCCA
GATGAATATG CGTTTTTGGA AAGTCGCCCA CGCGGTAGTA AGTTTGTGGC GGGTCAAGCA
ATCGAGCCTC CGCGAATACC CGATGGTGTG ATTTATCGGT TGTTAGAAAA ATTACTAATT
TTAGAAGGGG AACGGCTATC TTATAGATCC TTAGATGTAG AGCAAATTGG CTCGGTATAT
GAAGGCATTA TGGGTTTTGC GGTGGAACGG GCAGAAAGTC CCAGTATTGG AGTTTACAGT
AAACCCAAGG GTTCAAAGGT TTCTACAACG GTGGTAGTTG ATGTAGCAGC CATTTTGGCA
GCGAAATCGG GCGATCGCCA AAAGTTACTG AAGGAGTTAG CAAATTGCGA AGTATCAGGT
AATGCCCTCA AGGAACTGAA AGCAGCGCAA TCATTGGAAG ATATAGCGAT CGCACTAGGT
CGAAAGGTAT CGCGGCAAAC ACCAAATTTA TTGCCTGTGG GTTCGCTGTA CTTGCAGCCA
GGGGAAGAAC GCAGGCGTTC TGGTTCGCAT TATACGCCCA GGTCACTGAC TAAACCAATT
GTGGAAACAA CCCTGCGTCC GGTTTTGGAA GCGTTGGGAG AAAAACCGAC TGCGGAACAA
ATTTTATCTT TGAAAGTTTG TGATTTGGCG ATGGGTTCTG GTGCGTTTTT GGTAGAAACC
TGTCGTCAGT TGGCTGAGAA GGTGGTGGAA GCGTGGGAAC GCGAGGAAAA CGATACCCGC
CTGGGAATTA GCGATCACTC CGCCAGCCAA AGGCATCCCA ACCAAAATGC GACAATTAGC
AATTATGGCA AAGAAGAACC TTTATTAATT GCGCGTCGCT TGGTGGCGCA ACGGTGTTTA
TATGGTGTGG ATAAAAACCC GTTTGCGGTG AATTTGGCGA AGTTATCTTT ATGGTTGGTG
ACGCTGGCGA AGGATTTACC GTTTACATTT CTTGATCATG CCCTCAAGTG TGGGGATTCG
CTGGTAGGGT TGAGGAAAGA GCAGATTGGG TCTTTTGGGA AGGATGCTAC TGACGATTTA
CCGCTATTTA TATATTTAAA AGAGCAACTT GACCGTGCGC GTTCTTATCG GGCGGAAATT
CAGGCTTTGG ATACGCGCAG TGATGCTGAT GATGACCAAA AGCGGGATTA TCTATACAAA
GTAGAACAAG AATTGTACCA AGCGCGGTTA ACGGGGGATG TAAGAATTGC GGCGTTTTTT
GAAGGAAGTA ATAAGAAGCA GCGAGAGGAG AGAGAAACTG AGATTGCAGA ATTAGTAAGA
AAGTGGCGAT ATCACCAAGC TGATACTGAG AGTTTGGAGG AAATTGCTAG TAGGTTGCGG
AGTGGGGATA AGGGGATTAT TCCATTTAAC TGGGATATTG AGTTTCCAGA GGTTTTTGAT
AGAGAAAATC CGGGGTTTGA TGCAATTGTG GGTAATCCTC CGTTTTTGGG TGGAAAGCGT
ATTAGTACAG TTTTAGGCGA TGCTTACAAA GATTGGTTGC CTGTTGTAAA TCCTGAATCT
AACAGTAATG CTGACTTGGT TGCACACTTT TTCCGTCGTG CTTTTGACCT GTTACGTCAG
GGTGGAACTT TTGGATTAGT AGCAACTAAC ACAATTGCCC AAGGAGATAC CAGAAGTAGT
GGTTTACGAT ATATCTGTCA GAACCAAGGC ACGATTTACA ATGCTCAAAA ACGCATGAAA
TGGCCTGGAC AAGCAGCAGT AGTAGTAAGT GTGGTTTATG TGCTGAAGGG AACGTATAAA
GGTATATATT TGCTGAATGG ACGAGAAGTT TCTCTAATTT CTGCGTTCCT GTTTCATGTG
GGGACAAATG AAAATCCGGC AGTGTTGTTG GCGAATAGCA ATAAAAGCTT TATTGGCAGT
TATGTTTTGG GCATGGGCTT TACTTTTGAT GATACCAACG ACGAAGCAAC ACCTATTACA
GAAATGCACC GCTTAATTGA GAAAGATGGA AGAAATGCTG AACGAATATT TCCTTATATT
GGAGGTGAAG AGGTTAACAG TAGCCCAACT CACGCGCATC GTCGCTATGT TATCAACTTC
GGAGAAATGA GCGAAGATGA AGCGCGGAAG TGGTCAGATT TGATAGAGAT TGTTGAGATA
AAAGTAAAGC CTCATCGTGA CACATTAAAG CGAGATGCTT ATCGTAAGCG GTGGTGGCAC
TTTGCTGAAA AACAAGCAGC TTTATACAGA GCGATCGCTC CACTTGAAAA AGTGTTAGTT
GTCTCCCGAC ACCAGCCAAA TTGGTCAGCA GCATTTATGG GAGCAAATGT TGTTTTTTCT
GAAGGCTTGG TAGTTTTGGC TCTTTCACAA TACTCATCAT TTGCTCTCTT GCAATCTCGT
ATTCACGAAA TTTGGGTGCG TTTTTTCGGA TCATCCCTAG AAGACAGACT TCGCTACACT
CCCACAGACT GCTTTGAAAC CTTCCCCTTC CCCCAAAACT GGGAAACTAA CCCCACCCTA
GAAGCCATAG GTCAAGAATA CTACGAATAT CGCGCCGCCT TAATGGTTCG CAACAACCAG
GGACTAACCG ACACCTACAA CCGCTTCCAC GACCCAGAAG AACGCGACGC TGATATCCTA
AAATTACGCT CACTGCACGC CGCAATGGAT AAAGCCGTAC TCGAAGCTTA CGGCTGGAGT
GACATTCCCA CCGATTGCAC CTTCCTGCTA GACTACGACG ATGAGGAAGA CGAAGAAGAA
ACCAGCAACG GACGACAACG CAAAAAACCT TGGCGTTACC GTTGGACAGA AGAAGTGCAT
GATGAAGTTT TAGCACGCCT ACTCGACCTT AACCAAAAAA GAGCGCAAGC TGAAATTCTC
GGCGGTAAAG CAGCACAAAA GAAACCCAAG CCTAAAGCTA CTAAGAAAAA AACAACCAAA
ACCAAATCTA AAAAGGTTGG GGAAACTACA CCAATAATAC CGGGATTTGA TGTGGAGACT
GGCTTATGA
 
Protein sequence
MAIDPEITRH KEWLGFLQPV GLVVSPPALV KAQAVVNRNV VDFQQSLLAA VDEDGLIADF 
PAFTVNVLGW AESDLVKPSA EYEIALPDYS EILAPTYIVP DPDSDKPQIL VQIISPGTEL
DVVAPELTKS SSGWHASPQA KFERLLRETQ IPIGLLCNGG LLRLVYAPRG ESSGHLTFPV
QAMCEVSGRL ILGAMEMLLS AFRVFSATTG RSLKNLLEDS RKYQAEVSTT LANQVLDALW
ELLRGFQMAD AAVDGKLLSE IAATDPQHIY GGLITTLMRL VFLLYAEDEG LMPPDDIYQR
NYSVTGLYER LREDAGNYPD TMDQRYGAWV WLLSLFRLVY DGGGQTPEYL PARHGQLFDP
DEYAFLESRP RGSKFVAGQA IEPPRIPDGV IYRLLEKLLI LEGERLSYRS LDVEQIGSVY
EGIMGFAVER AESPSIGVYS KPKGSKVSTT VVVDVAAILA AKSGDRQKLL KELANCEVSG
NALKELKAAQ SLEDIAIALG RKVSRQTPNL LPVGSLYLQP GEERRRSGSH YTPRSLTKPI
VETTLRPVLE ALGEKPTAEQ ILSLKVCDLA MGSGAFLVET CRQLAEKVVE AWEREENDTR
LGISDHSASQ RHPNQNATIS NYGKEEPLLI ARRLVAQRCL YGVDKNPFAV NLAKLSLWLV
TLAKDLPFTF LDHALKCGDS LVGLRKEQIG SFGKDATDDL PLFIYLKEQL DRARSYRAEI
QALDTRSDAD DDQKRDYLYK VEQELYQARL TGDVRIAAFF EGSNKKQREE RETEIAELVR
KWRYHQADTE SLEEIASRLR SGDKGIIPFN WDIEFPEVFD RENPGFDAIV GNPPFLGGKR
ISTVLGDAYK DWLPVVNPES NSNADLVAHF FRRAFDLLRQ GGTFGLVATN TIAQGDTRSS
GLRYICQNQG TIYNAQKRMK WPGQAAVVVS VVYVLKGTYK GIYLLNGREV SLISAFLFHV
GTNENPAVLL ANSNKSFIGS YVLGMGFTFD DTNDEATPIT EMHRLIEKDG RNAERIFPYI
GGEEVNSSPT HAHRRYVINF GEMSEDEARK WSDLIEIVEI KVKPHRDTLK RDAYRKRWWH
FAEKQAALYR AIAPLEKVLV VSRHQPNWSA AFMGANVVFS EGLVVLALSQ YSSFALLQSR
IHEIWVRFFG SSLEDRLRYT PTDCFETFPF PQNWETNPTL EAIGQEYYEY RAALMVRNNQ
GLTDTYNRFH DPEERDADIL KLRSLHAAMD KAVLEAYGWS DIPTDCTFLL DYDDEEDEEE
TSNGRQRKKP WRYRWTEEVH DEVLARLLDL NQKRAQAEIL GGKAAQKKPK PKATKKKTTK
TKSKKVGETT PIIPGFDVET GL