Gene Ava_2844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2844 
Symbol 
ID3681486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3522416 
End bp3524494 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content47% 
IMG OID637718191 
Producthypothetical protein 
Protein accessionYP_323352 
Protein GI75909056 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.919934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.438174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAACGT CTACAAGTTC GCACAAGAGT AGAAAGAGTT CTCACATCAT GAAGCGCAGA 
CATAAAGTAG CCTGTCTCCA GACATCACTC ATCAGTCTCA GCTTAATCTC TGGCTACCTG
ATACCTAATT CTGCAATTGT CGCCGCACCA CCAAGAACCC CAGATAAAAC GGTCAATTGT
GACATTTTGG TGGTGGGTGG CGGACTTTCT GGGGTTGCTA CAGCTTACGA AGGTTTATTG
GCGGGTAGAA GGGTTTGCCT AACAGAAATT ACTGATTGGT TGGGTGGACA AATTTCCTCT
CAAGGAACTT CGGCTTTAGA TGAAAGACCC ACGCAAAGCC AACGTCAATT CTATTCTCGC
GGCTATTTGG AATTGCGTAA TCGCATTGAG AAAAAATACG GTAAGCTCAA TCCTGGAGAC
TGCTGGGTAA GCGATTCCTG TTTTCTGCCC CGTGATGCCC ATACAGTGAT GGTGGAATTG
CTCAAAGATG CAGAAAGGCG GGGTAAGGGT AAGTTGGAAT GGTTCCCCAA CACAGTAATT
AAGGATTTAG AAATTTCTCA GGGTAAATTA ATTAATAGTG CGATCGCTAT TCAACATCAA
CCAGTTAAAG GCGCACCACC CCTCAACACC TTTACCCTAT CACAAACCAT TGATGATGCT
TATCGTTATA GCAACTCATC CCGGTTGACC AAAAGTATTA TTCGCCTAGT TCCCCAACAA
ACTAAAGGAA ATAGTCCCAA GTGGTACATC GTAGATGCTA GCGAAACCGG AGAAATTATC
GCCCTAGCTG ATGTCCCTTA CCGCTTAGGT ATTGATGCTC GTTCTTTTCT GGAACCTTCT
TCCTCTAGTG CCAAAAATGA CCCCTATTGT ACCCAAGGCT TTACTTACAC CTTTGCAATG
GAGGCTACCA AGGAAGCACA ACCCCAGAAA ATGCCCCCAT ATTATTTACA ATATTCGCCC
TATTTCAGCT ACGAGTTACC GAGACTAGCA GATTTTGGCT TAGTTTTTAC CTATCGCCGC
ATTTGGAGTC CGACGAAGGG AGAACCTGTC AATTTCAACG GTGTGAGATT TTCTGCCCCC
ACACCAGGGG ATATCTCTAT GCAGAACTGG ACTTGGGGTA ACGACTATCG CCCCGGTACG
GCCAAAGATA ACTTAGTTTA CACCAGGCAA CAGCTACAAG GAACAGGGCA GTTAAAACCA
GGTGGTTGGA TGGGGGGACT ACGAACCGAA AGCCTCCGCA AAGCCGAAGA AAAAGCCTTT
TCTTACTATT ACTGGTTAGT AGCGGGAACC ACAGATTCCC AATTAGGGGA AGGTGTGAAG
CGTCCCCAAA CTAATAACCG CTTTTTAGCT GGGTTAAACT CCCCAATGGG GACAGCACAT
GGCTTATCGA AATATCCTTA TATGCGGGAA GGAAGGCGCA TCATCGGCCG TCCCAGTTGG
GGACAACCTA CAGGTTTTAC TATTTGGGAA GTCGATATTT CTCGCCGTAA TTATAATGAT
GAATACTACC GCAAAACCTT GCCAGCCGAT ATGTATCGTC AGTTGAGAGC GACATTAGCA
GGTTTAGAGG CGACTTCAGT GATTTCTGGT CAGGTTTCCC CAGATAAAGC GATGCGGCGG
ACTCGTTCCA CCATTTTCCC TGATGCTGTA GGCATTGGTC ACTACGCCAT CGACTTCCAC
CCTTGTATGG AGAAAAGCCC ACCAGAAACA CCCGGAAATA GAGAACGTCC TGGGGAACGA
CGGGGTGCAG GACAAGCCTA TCCCTTCCAA ATTGCCTTAC GGGCAATGAT TCCCCAAAAA
ATTGATAATT TGATAGTCGG GGGTAAAAGT ATTGCCACTA GCCACATTGC AGCCGCAGCT
TATCGAGTGC ATTCCTTTGA GTGGTCTGCT GGTGCGGCGG CGGGAACTGT AGCCGCTTTC
TCCTTAAAAA ATGAAGTTGC ACCTTACCAA TTAGTAGACG ATTTACCCAA ATCAGAACCC
CAATTGCAAG CCCTGAAGCG GCTATTAGAA AAAAATGGTA ATCCCACAGC TTTCCCTGAT
ACTTCCATCT TCAATCAAAA TTGGGAGGAT TGGCGGTAG
 
Protein sequence
MVTSTSSHKS RKSSHIMKRR HKVACLQTSL ISLSLISGYL IPNSAIVAAP PRTPDKTVNC 
DILVVGGGLS GVATAYEGLL AGRRVCLTEI TDWLGGQISS QGTSALDERP TQSQRQFYSR
GYLELRNRIE KKYGKLNPGD CWVSDSCFLP RDAHTVMVEL LKDAERRGKG KLEWFPNTVI
KDLEISQGKL INSAIAIQHQ PVKGAPPLNT FTLSQTIDDA YRYSNSSRLT KSIIRLVPQQ
TKGNSPKWYI VDASETGEII ALADVPYRLG IDARSFLEPS SSSAKNDPYC TQGFTYTFAM
EATKEAQPQK MPPYYLQYSP YFSYELPRLA DFGLVFTYRR IWSPTKGEPV NFNGVRFSAP
TPGDISMQNW TWGNDYRPGT AKDNLVYTRQ QLQGTGQLKP GGWMGGLRTE SLRKAEEKAF
SYYYWLVAGT TDSQLGEGVK RPQTNNRFLA GLNSPMGTAH GLSKYPYMRE GRRIIGRPSW
GQPTGFTIWE VDISRRNYND EYYRKTLPAD MYRQLRATLA GLEATSVISG QVSPDKAMRR
TRSTIFPDAV GIGHYAIDFH PCMEKSPPET PGNRERPGER RGAGQAYPFQ IALRAMIPQK
IDNLIVGGKS IATSHIAAAA YRVHSFEWSA GAAAGTVAAF SLKNEVAPYQ LVDDLPKSEP
QLQALKRLLE KNGNPTAFPD TSIFNQNWED WR