Gene Ava_C0237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0237 
Symbol 
ID3678036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp289206 
End bp292244 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content38% 
IMG OID637715317 
Producthypothetical protein 
Protein accessionYP_320511 
Protein GI75812894 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000237421 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000767054 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATATTA TTTCTGAAGG TGACAATCAT TTAGCTCCCT TAGATTACAG CAGTTTACTC 
AACAAAATCT TGGAAGTGCT GCCCAGTCAA AACCCATTCC AGTTCTCCCC AGATGACAGT
CGATTGCGGA TTAATATTGA CAAAATCGCC GATCTGGTTG CTAATTCACA AGTGGAAAGC
CCTTTGGCAT CTGCTAGCGG GATTCGTTGT GCAACTATCA ACCTTAGCGA TCGCACAAGT
AAAAATTTCC CAGAGCAAAT CAGGCTAATC CGCGATCGTT TGCAAAACCT CTTGATTGCC
GCATTACCAC CAAACAAAAC AGTAGAATCA GCGATTTCGC AATTACTGAC AGATTTACAG
TCTTTTCAAG GTACAAAGCC ATCTCTGGGA TTTACTTATC CTTTCGGAGA GTATACAGGT
TTGCAAAAAC AGCGGCTCAG TCTTCAGCGC CACCGTTCAG GCAGTGATTC TCTCTTGAAA
CTTCATAAAC TAATCATCAC AGTTAAAGAT GTCAAGGGAT TTGATTCTCA ACTTTCTTCC
AGTCTTGATA ATTATATCCA ACAAGAATTT AGCGAAGAAA CTGAGAGTGA TTTAGAGGAA
TTAAGCGACA CACTAGAAGA ACTGATTAAA AATCAAAAAT CTGACTTTTA CAAACTCAAA
CGAGTTATGT ATACCGAAGC TATTGGTATG CTCAAACGAG AGGCGAAAAT CAGGTATTTA
GAGTTTATTT TGGAGAATGT TAACGATAGT GTAGATGGCA AAATATATCT TCAAGATTTA
ATTCGACGGC TACGATTACT AGAAGAATAT ATCAATGATA TCAACCGAGG GTATGGTCAT
TACCAAGTAA ACTATGCCGG AAAGTCAGTC AACTACAGAG ACTTATTTTC TCGTGCCGAA
GCTTTAGATA TTCTCCCCAT TATTCCCTTA ATAGAAGGGT ATTTAGGAGA AACAAAAGAT
GATACTAAAG GAGAGCAGCA GTTTATTTTT GGTTTCAAAC TAAAATTTGG CGGTACAGTT
CAAGCTTACG GAGACAAGCC ACTTTCTGTA CTAGATTATT ACCTCAACCT AATCGACCCA
GAAAGTCAGG AACATAAGGC GGGATTAGGG GACTCTTTTA AAAGTGATTA TTTCAAAGAA
AAAGTATTAA AAATAATATT TTTATACTAT TTTATATTTG CTTCTAGGAG TAATCCTTTA
GCAGATAACT ACCAAGAAAA TTCAGAGCTA AATTATGAAC CTGTTTCGGT CTTTGAGCAA
CAGGTATTAC CTGTACTTAA AGGTTCTGAC GAAGAAGCTA AAAAGAAAAT ATTTGCCAAA
ATTAAGGAAG GAATTGAAAA ATTTAACGTC AAAATAAAAA TTGAAAAACT CAGGCAGTTA
CTTAAGGATT TGCTCAAAGA TAAGCCTATT CTAAAAAGAC GAGAGTATCC GCTTTATCTC
GGCGTTAAAA GAGGCATTTT AGAGCGCGAT TTAGACCGCA TTTTAACTGA TGCTACATTT
TTCAAGCCTG TCTTTGGTGA GAAGTCACGA GAAGCCCTAA AGTATATTGC CATCAGGGAG
CCTCATGTTG ATAGCAGCTA CCTTTGCAAT CTTTCTGTCA GTATTGCAGT AGAAGATATT
CGCTATTTCC CCACTGATGA CCGCCAAATA TTTAGCATGG CGTATGATAT CACAAATATT
AAAGCACTAC CAATTCTTTT GGTTCCTTTT ACTGAAGAAA ACTGTCAGAA CATCTACAAT
AATTCTTTCA AAGATCAAAA ACCAATTTTA TTCATCTACA ATCATAAAAA GTTACGAGAG
CAAATTTTCA ACAATCCTGA CTCCCCCAAA GCGTTTATTT ACCGATTTAC TTTTTCTCTG
CTTATTTATA TCTGCCTGAA AGTACTGCTT GAGTCTGTCA AAGATAAACT GTTTATTCCC
ATTGTGCGGT TACAACTGAA TATTAAGCAG AACTCCACAC CAGAAGAAGA ATTTATGCGC
TCGCTCTCTA AAATTTTATC CCACTTATTG AGCGAAGAAC ATCAATCCAG TACTCAAGGA
TTCTGTGTCA AAAACCTGAA TCCTTACAAA ATTGAAAATG GTCTAAGTTC TCTTTACTCT
GTACTGCCCA AAAAATTTAA ATTCAACGCT TCTTCTTACA ACACCCAGCT AGACAAACTA
GCAATTATTA TAGTCTCTAG CCGTGAGAGT GATGCCAGTT GGCGAGGTGG AGAACGAATA
TCAAACTTGT TAGGAGAAGT TGTAGGAGTA GAGCGTCTAC AGGATGGCAC AGTTCGCTTA
GAAAGGCTCA ACACCTTCGC AGCAAACTAT GAACATCAGC AGATGTTTCA AAGACCTACT
ATTGTCATTG ATGAAGTGGA CAAATTGTAC CGCCAGGGCT ATCGGCACTT TATGTATATT
GCCAAGTCTC CATATTCTAG CACTCTCAAT ATGACTCAGA GTGATGAAGA TGAATCACTA
TTTTTCATGT CTACACCTGT ACTCAAGGCT TTAAAGGGAG ATAGAGACGA CATCAAAATA
TATCCAGTAT TTTTTGATAA ATATTATGTG GTAAATCTAG GGCAGACTGT TAGTTCTCTG
TACATACAAG ACACGCTTGA ACTAACAAGC TTAGTTGAAG ATCCCAGCAA GAAAGCTGTT
GTCTTTTTCA ATCTCTTCAA TGGAATTAAA GTCGGTCAAG ATGACGAGCG TTACTACAAC
GGGGTAATTT CCTACGCCAC CTTGCTAAAT GTCTATGATG ACATTCTAGA TGATGAAGAT
ATCCGTAGGG GACTAATTTA TGACCGCGAA AACAATCCTA TCAAAAATGA TATCTTGCAA
TATCTAACCT TATTCCACTT TTCGCGTTAC GAAGCTGTTC CCAGAAAGAA TCGGAACATT
AGCTTTAAAC TTGACCCTTA CGAAAACATA ATTGGGGATG ATGCGGTTGG TAAATTGTCA
GTTTTCAAGC ACACCACTAG GAGTGTAAGC TTTAACTCTT TAGCCTTTTT AACTGAAGTT
AGACGTGCTT TGAATGTAGA AGGAGAGGAG CGGAAATGA
 
Protein sequence
MNIISEGDNH LAPLDYSSLL NKILEVLPSQ NPFQFSPDDS RLRINIDKIA DLVANSQVES 
PLASASGIRC ATINLSDRTS KNFPEQIRLI RDRLQNLLIA ALPPNKTVES AISQLLTDLQ
SFQGTKPSLG FTYPFGEYTG LQKQRLSLQR HRSGSDSLLK LHKLIITVKD VKGFDSQLSS
SLDNYIQQEF SEETESDLEE LSDTLEELIK NQKSDFYKLK RVMYTEAIGM LKREAKIRYL
EFILENVNDS VDGKIYLQDL IRRLRLLEEY INDINRGYGH YQVNYAGKSV NYRDLFSRAE
ALDILPIIPL IEGYLGETKD DTKGEQQFIF GFKLKFGGTV QAYGDKPLSV LDYYLNLIDP
ESQEHKAGLG DSFKSDYFKE KVLKIIFLYY FIFASRSNPL ADNYQENSEL NYEPVSVFEQ
QVLPVLKGSD EEAKKKIFAK IKEGIEKFNV KIKIEKLRQL LKDLLKDKPI LKRREYPLYL
GVKRGILERD LDRILTDATF FKPVFGEKSR EALKYIAIRE PHVDSSYLCN LSVSIAVEDI
RYFPTDDRQI FSMAYDITNI KALPILLVPF TEENCQNIYN NSFKDQKPIL FIYNHKKLRE
QIFNNPDSPK AFIYRFTFSL LIYICLKVLL ESVKDKLFIP IVRLQLNIKQ NSTPEEEFMR
SLSKILSHLL SEEHQSSTQG FCVKNLNPYK IENGLSSLYS VLPKKFKFNA SSYNTQLDKL
AIIIVSSRES DASWRGGERI SNLLGEVVGV ERLQDGTVRL ERLNTFAANY EHQQMFQRPT
IVIDEVDKLY RQGYRHFMYI AKSPYSSTLN MTQSDEDESL FFMSTPVLKA LKGDRDDIKI
YPVFFDKYYV VNLGQTVSSL YIQDTLELTS LVEDPSKKAV VFFNLFNGIK VGQDDERYYN
GVISYATLLN VYDDILDDED IRRGLIYDRE NNPIKNDILQ YLTLFHFSRY EAVPRKNRNI
SFKLDPYENI IGDDAVGKLS VFKHTTRSVS FNSLAFLTEV RRALNVEGEE RK