Gene Ava_4097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4097 
Symbol 
ID3681562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5089916 
End bp5091094 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content44% 
IMG OID637719445 
Producthypothetical protein 
Protein accessionYP_324593 
Protein GI75910297 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.407724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.826772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAGCT TAGATGCACT GGCTCTCAAA AACTCTACAC ACTTATGGCT AGAAATTTCG 
GAAACTGAGC AAAAAAAAAT TTGGCAGCAA AGCCAAGCTT TTTCCTCAGA TAGTCGGCGG
TGGAGTGCCT ATCTCAATCG CTTGAGCCTG AACACTTTTC TACCTTGGCT ACAAGCCGAA
CACAATCCCG ATGCGACTCC TTTTCCCCGA CTGGCTGCGC TACCCAGTGT TTGGGAAGTA
GTCAATGGTT TTGGGATTTC TTTCGGGACA AAACGTATGG TATTAATTCC TACGGAAGCA
GTTGATTTAA GCGAACTGCG CGTTCCACAA GAATGGGTAG ATATTCCTAG TTGGACAGCA
GACTATTACC TAGCCGTACA AGTAAATCTC GAAGAAGGCT GCATCCGGAT TTGGGGATAC
ACCACCCATG CGCTACTGAA GAGTATGGGT ACTTATGACG CAAGCGATCG CGCCTACTGT
GTGGATGGAG AGAATTTGAT TGCTAATCTT GGTATTTTGT GGATAGCAAG TCAACTTTGT
GTGGAAGAAC CCACACGTTC TGAAGAAACA GTACCTTTAC AAGAGTTAGC GATCGCGCAA
GCACAAAATT TACTGCAACG TCTCGGAAAT CCGGCTCTAA TTTTACCCCG TCTAGCAGTA
CCCTTTCCCA CTTGGGGTGC GCTCCTACAA CATGGCGGTT GGCGGGAACA TTTGTATGAA
CAGCGTCAAG GACAACAACA AAAGTGGTCA ATTACCCAAT GGTTGCAAGC TGGTGTATCG
GATTTTGCAC AAGCATTCGG CTGGCATAGT GTCGAGCTAG AGTCAAATTT TGCAGGAGCT
AGAGGTTTAG AATCAAAAAC TGAATTACCA ACTTTAGTGC GAACACTCAC TATTGCTGGG
CAAGAATATG AATTGCGAGT CAAGGCAAAA AATAGTATTA CAGATAGAGT CTGGCGATTT
GAATTACAAA ATGCAATGAG AGGTGAAATG ATTGCCCAGG GAATAAAATT ACGACTGTTG
ACAGAAAATT TACAACCATT TTATGGCAAT CAAGTACAGG CTAATACTCC AGTAAATAAG
CTATACCTGG AAGTGGCACT GGGTGATACT GAGGAAGGAT TGGTATGGGA AATAGAGCCA
ACTCCTGAAG ATTTTGAACA CGAAATTTTG TTTTTTTGA
 
Protein sequence
MVSLDALALK NSTHLWLEIS ETEQKKIWQQ SQAFSSDSRR WSAYLNRLSL NTFLPWLQAE 
HNPDATPFPR LAALPSVWEV VNGFGISFGT KRMVLIPTEA VDLSELRVPQ EWVDIPSWTA
DYYLAVQVNL EEGCIRIWGY TTHALLKSMG TYDASDRAYC VDGENLIANL GILWIASQLC
VEEPTRSEET VPLQELAIAQ AQNLLQRLGN PALILPRLAV PFPTWGALLQ HGGWREHLYE
QRQGQQQKWS ITQWLQAGVS DFAQAFGWHS VELESNFAGA RGLESKTELP TLVRTLTIAG
QEYELRVKAK NSITDRVWRF ELQNAMRGEM IAQGIKLRLL TENLQPFYGN QVQANTPVNK
LYLEVALGDT EEGLVWEIEP TPEDFEHEIL FF