Gene Ava_5054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_5054 
Symbol 
ID3683536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6346168 
End bp6348018 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content43% 
IMG OID637720415 
ProductSerine/threonine protein kinase 
Protein accessionYP_325546 
Protein GI75911250 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[S] Function unknown
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.688753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATCT GCCAAAATCC CAATTGTTCA AACCCATTCA ATCCTGATGG CAATAGATTT 
TGCATGAGTT GCGGACAAAG CAACTTTGGC AAACTACTAA GAAATCGCTA CCGCGTACTA
GGACTATTAG GTGAAGGTGG ATTCAGCAAA ACCTACGCAG CTGAAGACGC AGATAGACTA
GATGCGCCTT GTGTCATCAA ACAATTCTTT CCACAAATTC AAGGAACTGG ACAGCGCAGC
AAAGCGGCAG AATTCTTTAA AGAAGAAGCC TTCCGTCTGT ATGAACTGGG AGAAAATCAC
ACTCAAATTC CGAGATTATT AGCTTACTTT GAACAAGGTT CTAGCTTATA TCTTGTTCAA
GAGTTTATTA AAGGACTAAC ACTTTTACAA GAAGTTCAGC AAGAACCCTT TAATGAAGAA
AAAATTCGGC AACTGTTAAT TGATTTATTA CCAGTTCTCG ATTTTATTCA TTTTCATCAA
GTTATCCACC GAGATATTAA ACCAGAAAAT ATTATTCGCC GTGATGGTGA TGGCAAATTA
GTATTAATTG ATTTTGGTGG TGCTAAACAA GTTACCCAAA CTAGTATAGC TAGACAAGCA
ACAGCTATTT ATACCATTGG TTATGCACCA ACGGAACAAA TGGCAGGGTT TGCTTGTCAT
GCTAGTGATT TATATGCTTT GGGTGTTACT TGTGTTAGAT TATTAACCCA ATGCCTACCG
CAACAAAATC CTTACGGACA TATCGATGAT GGTCTTTATG ACCCCATGAA TGGTAAATGG
TTATGGCAAG AATATCTACA GGACAGAGGT ATCAAGATTA GTGAGAATTT AAGTCAAATT
TTAGATAAAT TACTCAAGCA TTTACCTAGC GAAAGATATC AATCAGCTGC CGAAGTCCTC
CATGATTTAC AAGTATCCAC AGCCATAGTT GAAGAAACTC AAATTCTGCC AAATACTCAA
ACAACATTAG TCCCATTAGC ACCAGCAACA CAAAGAACAA AACGGCCATT ACCTCCCCTA
CAAACCTTTG AGTATGAGGT AGTTACAGTA GATACAGCTG GCCGCATCGT CAACCGCGAT
CGCACCAATA CCCAAATTTT GGTAGAACAA TTAAACAAAG ACATTACCCT AGAAATGGTG
TCAATTCCTG GCGGTGCATA CCTGATGGGT TCGCCCAACT TTGAAGGCGA TGCCGACGAA
CGCCCCCAAC ACCAAGTAGC GATCGCCCCC TTTTTCATGG GAAAATATCC CGTAACTCAA
GCACAGTGGC GAGCTGTGGC TGGCTTACCC AAAATTAAAC AAGCCTTAAA TCCCTACCCC
TCAAAATACA AAGGTCAAAA TCGACCAGTA GAAAACGTCT CTTGGCACGA AGTTTTAGAA
TTTTGTGCTA GACTCTCCGA AAAAACCGGA CGGGAATATC GCCTACCCAG CGAAGCCGAA
TGGGAATATG CTTGTCGTGC TGGAACAACT ACATCTTTTC ACTTTGGCGA AACCATCACC
CCCGATTTAG CCAACTTTAG CGACGGCGAT ATTCACAATC TTGAAGCCAA AACCAGATAC
CGCAAAGAAA CTACAGATGT CGGCAACTTT CGCGTAGCCA ATGCCTTTGG ATTGTACGAT
ATGCACGGAC TAGTCTGGGA ATGGTGTGCC GACCCTTGGC ACAATAACTA CAACGGCGCA
CCAACTGACG GTAGTGTGTG GGAAGCAGGT GGTGATATAT ATCGCCGAGT GTTGCGTGGC
GGTTCTTGGA ACTTTGCCGC AGAACTGTGT CGCAGCGCCA GCCGCAGTTG GAATGAGTCA
GACGGTGGCT TAAGGATATG CGGTTTTCGA GTTGTATTTT CCCCAGGTTA G
 
Protein sequence
MQICQNPNCS NPFNPDGNRF CMSCGQSNFG KLLRNRYRVL GLLGEGGFSK TYAAEDADRL 
DAPCVIKQFF PQIQGTGQRS KAAEFFKEEA FRLYELGENH TQIPRLLAYF EQGSSLYLVQ
EFIKGLTLLQ EVQQEPFNEE KIRQLLIDLL PVLDFIHFHQ VIHRDIKPEN IIRRDGDGKL
VLIDFGGAKQ VTQTSIARQA TAIYTIGYAP TEQMAGFACH ASDLYALGVT CVRLLTQCLP
QQNPYGHIDD GLYDPMNGKW LWQEYLQDRG IKISENLSQI LDKLLKHLPS ERYQSAAEVL
HDLQVSTAIV EETQILPNTQ TTLVPLAPAT QRTKRPLPPL QTFEYEVVTV DTAGRIVNRD
RTNTQILVEQ LNKDITLEMV SIPGGAYLMG SPNFEGDADE RPQHQVAIAP FFMGKYPVTQ
AQWRAVAGLP KIKQALNPYP SKYKGQNRPV ENVSWHEVLE FCARLSEKTG REYRLPSEAE
WEYACRAGTT TSFHFGETIT PDLANFSDGD IHNLEAKTRY RKETTDVGNF RVANAFGLYD
MHGLVWEWCA DPWHNNYNGA PTDGSVWEAG GDIYRRVLRG GSWNFAAELC RSASRSWNES
DGGLRICGFR VVFSPG