Gene Ava_D0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_D0044 
Symbol 
ID8952431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_014000 
Strand
Start bp31145 
End bp33658 
Gene Length2514 bp 
Protein Length837 aa 
Translation table11 
GC content49% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003541167 
Protein GI292905296 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00826544 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAC CAGGAACAGA AGTAAAAATC ACCAAAGGCG ACCACAAGGG GAAACGCGGG 
AAAATTGGCT CTAACCGTCC TAGTAACTCC CTGCAAAAAG GTAAAGTCCC TGTAGTCGTT
GGCAATAAAA ATCAAGTCAT CTGGACTCGC CCCGACTGGG TTTCAGAAAT CATCTCCTCC
AATCCCCATT ACCCCTTATC CCCAGAGGGG GCCTCACCTT CCCCAGTCCC CAATCCCCAA
TCCCCAATCC CCAATCCCCG ATCCGAAACA GAAAAACGCA TCCTGGGATA CCTAAAGACC
CGAATAATTG CTCGTTCCCA AACAGAGATT GCGATCGCAC TTGAAGCCTC CATGTTGGAA
ATTGGCATGG CTCTCAAGGA CCTAGAGCGA GACGGACTAG TTTGTAACAA CGGTGCAGAT
TATTATCACC TGCCAAAATT TCAAGTTGGG CAGTACGTTA CAGACGGAAC ACGCACCGGG
TTAATCAGCA GTGAAGACAA AAACATTTTC ACCATCACTT TTGGCGAAGA TGAACTGCAA
TGCACCATTG AAGACTTAAG GGAAAATTTC CAAGTATGCG GCAACCAATT AGAACTATTG
GACTCTTTGA AGGCATCGGC GGATTCTGTA GAGCAGCCGA ACTCATCGGC GGATTTGAAT
GGGTTGAGTC AGTCGAACTT GATAACGATG CCGTCCAAGT CCTCAGAGAT AATTTTACCC
ACCACATCTA CCACGGCGAC ATCCGAGACT ACCACCCCCA ACCAGGAGCC GCCGACCTCT
ACACAATCGG GTTCCCCTGC GACAACACAA GTAACGCCGG CGACAGAACA GGATTACTCG
GAGACAAATC AGGACTGTGG TTGGAAGCGC TCCGTTGCGT TGTTGAGGGA TTACCTAAAT
TCGCCATTAT CGAGCAGCCA GAAGGAATTA TTCATAGAGG CCTGCGAGGA ATCCTTGGCG
GACTGCGAAT GGCAGGATAT CAGTGGGAAG ATCCGATCCT CCTACCGAGC GCTAGTGTTG
GAGCAACGCA ACGCCGCACC AGACTTTTTA CAATTGCCTA CCTTAACAGC CTTGGATGGG
AAAACTTCCC GACCGGGTGG AACGACCAAG TGCGATCGCA CTGCGAGGAA GTTAGGGCTA
ATTACAGATT CCCAACTATT GAGCGTAGAG GCGATGGCAG CCATCTCTGG ATTCCCGACG
AACTGGACGA AGTGCCTTTC GGGGTTGAAC CTAGAACTCA ATCAGGAAGA CTACAATCAA
GAATCCTTTA CGGCAGAACA GTCATCCCCC AGCAAGCTGC AATTGCCCTC CAGCGAGTCA
AATACCTGTG GGAAACAATG GATCGACCCC AAGAAAATCA TTCTCAAACA TGGGACACAG
TATCGTTACT ACGTTGATGC AGCCTACGCC GCTAATGTCA AAGTAGTGCA GGGATTTGCT
GAAGTAATGC AGGAAGATTT ATGGGATTGG TATCGTGAGC CTTTGCCTGT AGCCTTCCTT
TCATCTGAGG GGAAAATCTA TGTAGGCGAT GGGCATCACC GGGTATCAGC CGCCCATACT
GTTAAAAAAC AAATTTACGT TGACCTGCGA CCGGGCGAAC TCGTAGATGC TATTTTATTT
AGCTGTCAGT CAAACACTGA CCACGGCTAC CAACTCCGGG CTAAAGACCA ACGTAAACGG
ATAGAGATGT TCTTGGATAC TCTGGATGGG CTGGATGAAG TGCGATCGCG CCAACTACTA
GAATCCGTTC CTGGTTTAAG CGAGATTGAG CGCCGTAACT GCCAGGGTGG TAAATGGTCA
GCGCGGGTAG TTGCTAAATA TTTACGGTTG ACAGAATCCG GCTACCGCAC TGTTATCAAC
ATCCAGCAAG AACGGGAAAT GGTCGGGTAT TTTTCCCAGT TCAGTGAAGG TGACTGGGTG
CGGGTACAAG AGAGTATCGC TGATGGGACT GATTTCCCTT GGGGGACAAT CGCCCAAATC
CAAAACCTTG ATAAGCGTAA AGGTGTTTTC ATTATTCCTT TACCCGGTGC AACTGATAGG
GAAGGAAGAA TATTGCCGAG TGGTTATATT CATCCTCGGT GCTTAGAAAA AACCGAAGCT
CCCCAATTGC TAGATACTGG GAAAATTACT CAACCAGAAG CTATCACCTC AATCGAGCAA
GAGGTGGAGC AGCAAGCAAC CGAGTTAGGG TTGAGTAATC GCTCACAGAT ATTACCTGAC
GTAGAACGCA ACCAAGGCGA ACCCCATAGC CTAGTAGATG ACAGACCTTT TACTGGTGAA
GGCGATCAAC TCAGTACCTG GATTAATGAT CTACCAGAAG AAAATTTCCA AAAGGTATGG
AGTGCGATTA ACCAAAGGCA ACCTGAAGAT GTTAGTGGTT TTGTCAAACA CCTTTCTGAT
GAGCAACTTT GGAGTGCGAT CGCATCCCGA AGCGATGAAA CCTTAGAAAA ACTAATCGCT
TGGGCAACCG AGATTTTAGA ACAGCGCCGG GAGGAAATGC CCCATGCCTC GTAG
 
Protein sequence
MMKPGTEVKI TKGDHKGKRG KIGSNRPSNS LQKGKVPVVV GNKNQVIWTR PDWVSEIISS 
NPHYPLSPEG ASPSPVPNPQ SPIPNPRSET EKRILGYLKT RIIARSQTEI AIALEASMLE
IGMALKDLER DGLVCNNGAD YYHLPKFQVG QYVTDGTRTG LISSEDKNIF TITFGEDELQ
CTIEDLRENF QVCGNQLELL DSLKASADSV EQPNSSADLN GLSQSNLITM PSKSSEIILP
TTSTTATSET TTPNQEPPTS TQSGSPATTQ VTPATEQDYS ETNQDCGWKR SVALLRDYLN
SPLSSSQKEL FIEACEESLA DCEWQDISGK IRSSYRALVL EQRNAAPDFL QLPTLTALDG
KTSRPGGTTK CDRTARKLGL ITDSQLLSVE AMAAISGFPT NWTKCLSGLN LELNQEDYNQ
ESFTAEQSSP SKLQLPSSES NTCGKQWIDP KKIILKHGTQ YRYYVDAAYA ANVKVVQGFA
EVMQEDLWDW YREPLPVAFL SSEGKIYVGD GHHRVSAAHT VKKQIYVDLR PGELVDAILF
SCQSNTDHGY QLRAKDQRKR IEMFLDTLDG LDEVRSRQLL ESVPGLSEIE RRNCQGGKWS
ARVVAKYLRL TESGYRTVIN IQQEREMVGY FSQFSEGDWV RVQESIADGT DFPWGTIAQI
QNLDKRKGVF IIPLPGATDR EGRILPSGYI HPRCLEKTEA PQLLDTGKIT QPEAITSIEQ
EVEQQATELG LSNRSQILPD VERNQGEPHS LVDDRPFTGE GDQLSTWIND LPEENFQKVW
SAINQRQPED VSGFVKHLSD EQLWSAIASR SDETLEKLIA WATEILEQRR EEMPHAS