Gene Ava_D0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_D0033 
Symbol 
ID8952420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_014000 
Strand
Start bp22354 
End bp24309 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content50% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003541156 
Protein GI292905285 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.417677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAAG TCAGGCGTGA TACTAATCGC GGTACTGGGG TATGGAGTGG GGTGGTTCAG 
ATAGCAGCAG CCCACACTTT GATTATTGTG GGGCTGTATG AGCCAAGTGT CTTGTTGCCC
ACCACGGCTA TTGGATTTTT TATTTCATCT TGGGTACTGG GAAAATTTCC AAATCCCTTA
CCCAGAAAGA ATGAACTCAT TGCCGCAGCG CACCGGGAAA ATTGGCGGAC ATCGGCTTTA
ACCGAGGTGG AAAAATCGGC TCAGTCTAGG CTGGAAGAGA TTGAGCAGAA GGCTGGCGTA
TTGGTAGAGA TGCGATCGCA ACTGCAAACT AAAAGCACTC AGTTGGCGCA ATGGGAACAA
AATCTATTGC AAGCTGAGGG GAAATTCCGC GAACTGCTGC AACAGACCGA AGACCACTAC
AAGGAATTAC TGCAACAGCA AGCCGAACAG TACATGGCGG AGATACGGCT GCGGGAAGGG
ACTATCGTTG GCTTGCAGCA AAAAATCATG AGAATGGCAA ACAGGCCAGA CCCCAAGCAA
GGGTTCGCGG CCTGGGTTGC TCAAATGCTG CTGGACGCTT TGGAACAAAA CCAAGTTTAC
TGCCGGATGG TTGCTTTTCA AAAAGTCCCC GGTGTGCGTG AGGTTAGCGT CTGGGTGGAG
TTGGAACCAA TTGCGGTTGA ACCGGGTTAC GCCCGGAAAC TGGAGGGGTT ATCTAAAGAG
GTAGCATCGC TGGTAAAGCT GGGTGAGCCG GCTATCCAGT GGGATGAGGA TGAATGTATC
TGGGAATTTC GCTTTGCTCC CAGATACGAA ACTGAAGAAA GTTTGCTGGC TAACGTCATC
GACGACCCGA AAGAATTACC CGCCGTCATT GAACCCGATG ACCCCGATTG GTTCCGTCGA
GCAATTCGTG TTTCCTTCAG CTGCTTAATT CACGGTGGGA TGGGTGCGGG TAAGTCTGTA
CTGGTAAGTA ACTTAATCTG CTGTGCCAAT CAGGAGTTAG AACAGGTTTA CAGTCTTGCA
CCAGAACTGG TAATCATTGA CCCGAAATTC CCAGACAGTG AATGGATTAT CGCCGGCAAG
CGAATCAAAC CAAACTATCG GGGGTGGGAA AATTCCATTA CTGGCGTAGC CGACATGGGA
CAAGAAGTTG ACCGCCGACT TGATGAAGCC AAAGCGATCG CAGAAGAAAT CGACGAAGAA
CTATTTGCAG ATCCTAATTA CGTTCTACCC CTGCCAGAAC GTACACCGAC TATCTGGGTG
ATTGATGAGG CCGCAGCGCT GCATGAGCGA TATGGTAATG AATTTTCCGA ACCACTCAAG
AATGCTTTTT GGGTCGGTCG TTCAACTCGT AATGTGGCGA TCGCAATTGG CCAAAATCCC
AACTGCTCTA ACTATGGTTT ACAGCGACCA GACGCAGAAA ATGCCAGCCG CTTTTTCTTG
GGTGCAGCAA TGGCCTTGAA GGGACTGGAT GAACTCAAAA CCACCAGGGA ACTGAAAACC
AAACTCAGAC AACAAATATA TGCGCGGTTG GCATTGGTCA GACAAAGGAA GTTGCAAGGT
ATCGACAAGC CGGAGGAACA ATACTTTGGC TTGGTGGCAA TTCAAAATGA GATGCCATTT
ATTGCTCAGA TGCCACCACC CAATGCTTTT GCTTTTGACG GCGAGGTGGA ACTGAGGGAA
GGGGAGGAAA TTGAAACCAT TGAAAACCTT ATTGAACAAG CCGACTGGGA AGAAACGCTA
CGGGCCCTAC CTGAAGAACT GCGAATCGTG GCTGAATACG CCAAACGCAA GCGGGGGGAT
TGGGTAACGG CGCGGGACAT TAAGCGCGAT CGCAACCGTG GGAAAATTTC CACCGCATCA
ACTGAAGAGG TGCGGCAATG GTTCACCCGG TTAGCAGCTA TGGGTATAGG GGAGGTGACG
GGTGGTGGTA CAGCGCTCAA ATACAAACTG GGATGA
 
Protein sequence
MAQVRRDTNR GTGVWSGVVQ IAAAHTLIIV GLYEPSVLLP TTAIGFFISS WVLGKFPNPL 
PRKNELIAAA HRENWRTSAL TEVEKSAQSR LEEIEQKAGV LVEMRSQLQT KSTQLAQWEQ
NLLQAEGKFR ELLQQTEDHY KELLQQQAEQ YMAEIRLREG TIVGLQQKIM RMANRPDPKQ
GFAAWVAQML LDALEQNQVY CRMVAFQKVP GVREVSVWVE LEPIAVEPGY ARKLEGLSKE
VASLVKLGEP AIQWDEDECI WEFRFAPRYE TEESLLANVI DDPKELPAVI EPDDPDWFRR
AIRVSFSCLI HGGMGAGKSV LVSNLICCAN QELEQVYSLA PELVIIDPKF PDSEWIIAGK
RIKPNYRGWE NSITGVADMG QEVDRRLDEA KAIAEEIDEE LFADPNYVLP LPERTPTIWV
IDEAAALHER YGNEFSEPLK NAFWVGRSTR NVAIAIGQNP NCSNYGLQRP DAENASRFFL
GAAMALKGLD ELKTTRELKT KLRQQIYARL ALVRQRKLQG IDKPEEQYFG LVAIQNEMPF
IAQMPPPNAF AFDGEVELRE GEEIETIENL IEQADWEETL RALPEELRIV AEYAKRKRGD
WVTARDIKRD RNRGKISTAS TEEVRQWFTR LAAMGIGEVT GGGTALKYKL G