Gene Ava_4106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4106 
Symbol 
ID3681494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5109651 
End bp5111141 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content35% 
IMG OID637719454 
ProductSerine/threonine protein kinase 
Protein accessionYP_324602 
Protein GI75910306 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.998471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.522407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAACG TATTAGAGAT GATTACCGAA ATCGAGCCAG GAACTTTAAT ATACGGTCGT 
TACCAGATTC AGAAGTTGCT TGGTAAGGGA GGATTTGGAC GAACTTATCT AGCATTAGAT
AATCAGCGAT TTGATGAACC CTGTGTGTTG AAAGAGTTTG TCCCCACAGC ATCACAAGAA
AAAAATGTTT GTAAATCGAA AGAGTTGTTT GAGCGTGAAG CAAAGGTTTT ATATAAGCTT
AAGCATCCTC AAGTTCCTCA ATTTTTAGCT TGGTTTACGG ATAGCGATCG CACTTTTATT
GTGCAGGAAT ATATTGATGG TAGAACTTAT TCTGAAATTT TATTTGAGCG TGTCTCAGAA
ACAGGTCAGC CTTTTTCGGA AATAGAAGTG AGAACATGGT TGACAGATGT CTTACCAGTT
TTAGATTATC TCCACGATCG CAAGATCATT CACAGAGATA TTTCACTGGA GAACATCATG
CTACCTCATC ATCAATCGAA ACCTGTGCTA ATTGATTTTG GCGCAGTAAA AGAGAATGTA
ACTCAACTTA TGTCTCCTGA TTCCATAAAT TTTTATAACT CCATTCACAC TTCCGTCGTG
GGTAAGTATG GCTATTCTCC GCCTGAACAG TTGCGTTTAG GAATTTCCTA TCCTTCTAGT
GATATCTATG CACTTGGTGT TTGTGCAGTT GTACTATTAA CAGGAAAAAT GCCACATTTG
CTCTTAGACG AATCACTAAA TTGGCAATGG CGATCGCAAG TCAATATTGC TGATGATTTA
GCAGCAATTA TCGACAGAAT GCTGATAGAA TCACCCACTG CACGCTTCCA ATCAGCTAAA
GAAATTATTC TGAAGTTAAA TAAGCTCCAT AATAATTCCC CTACTGTAAC TCAGGTTGAA
TTCAAAATTA CATCTCCCAT AGAAGCAATT AAAACTCGTC AACAAGAAAA AGAGACCAAC
AAGGCATTAG AAGAGTTATT AATTTTACAA AATCTAGAAC GCACCCTGAG ACAATATCAT
GATAAATTGC CAAAACCTAT TTATCTCAAC TTAGATTTAC CAGAGTATAT GGAGAAACAC
ACAACTCCTG CTAGTGAATC TTCTGGTTGT GCTTTTAAAA AAACTTCTAA AATAGCTGCA
AAAATTATTA ATATTTTTAC TAGAAGAGTT AACAGACATA TAGTAAGAAA AAGTGAAGCT
ACTAACGTAA ATTATATTCA GATTAATACA CATGATTTTT TAGAAAAAAC ATCTATCAAT
CGTAACTCGC AGATTTTGGA AGTTATTAAA AAAGAATTTA CAAATTTTAT TGGCCCAATC
GCCAACTTAA TTATGAATAA GGTATTAGTA ACTTTTCCAG ATTGTTCTGC TAATCAACTT
ATAGAAATTT TAGCAGCATC AATTCCTGAC AAAATAACAG CAGAACGGTT TCAAAATGAT
ACACGTAAAC TCATTATATC AAACTTGTAT ATCGTAAAAA CTAATGAATA G
 
Protein sequence
MNNVLEMITE IEPGTLIYGR YQIQKLLGKG GFGRTYLALD NQRFDEPCVL KEFVPTASQE 
KNVCKSKELF EREAKVLYKL KHPQVPQFLA WFTDSDRTFI VQEYIDGRTY SEILFERVSE
TGQPFSEIEV RTWLTDVLPV LDYLHDRKII HRDISLENIM LPHHQSKPVL IDFGAVKENV
TQLMSPDSIN FYNSIHTSVV GKYGYSPPEQ LRLGISYPSS DIYALGVCAV VLLTGKMPHL
LLDESLNWQW RSQVNIADDL AAIIDRMLIE SPTARFQSAK EIILKLNKLH NNSPTVTQVE
FKITSPIEAI KTRQQEKETN KALEELLILQ NLERTLRQYH DKLPKPIYLN LDLPEYMEKH
TTPASESSGC AFKKTSKIAA KIINIFTRRV NRHIVRKSEA TNVNYIQINT HDFLEKTSIN
RNSQILEVIK KEFTNFIGPI ANLIMNKVLV TFPDCSANQL IEILAASIPD KITAERFQND
TRKLIISNLY IVKTNE