Gene Ava_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1105 
Symbol 
ID3678520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1346818 
End bp1349016 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content47% 
IMG OID637716441 
ProductTPR repeat-containing protein 
Protein accessionYP_321624 
Protein GI75907328 
COG category[R] General function prediction only 
COG ID[COG4248] Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTAC TACGTTATCT TCCTCAGGAA GAGATTATCA GCCTCAGCGT GAGTTTGGGG 
CGTGGCGGTG AAGCGTGCAT TTATGCTGTA CCGTCGGCGG GTGATTGTGT GGCAAAGATT
TATCACAAGC CGACAGTTGC CCACGCCAGC AAACTCCGGG CGATGCTGGC TAACCCGCCG
GAAAATCCTA CGGCTAGTTT GGGTCATATT TCCATTGCTT GGCCGCAAGA ATTATTATGG
GGAGCAGATG AAAGCGAACG CGTCATTGGC TTTTTGATGC CGCGTATTCG GGGGATGCGT
CCTATCATCG ACTTTTACAA CCCCCGGACT CGCCGTCAAC ACTGTCCTTT ATTTAATTAT
CAGTACCTAC TGCGGACAGC GCGGAATTTG GCGGCGGCTT TTGCGGCTTT ACATAATAGC
GGCTATTCTG TAGGCGATGT CAACGAATCG AACATTTTGG TAAGTGACAC AGCCCTGGTT
ACCTTGGTAG ATACGGATTC TTTCCAAGTA TGTGACCCTG ATAATGATCT TGTTTATCGT
TGCCCGGTAG GGAAACCAGA GTTTACCCCA CCAGAACTAC AGAATAAAAT CTTTGCTCAT
CACGATCGCC AAGCTACTCA TGATTTATTT GGTTTAGGGG TGCTAATCTT CCAACTGCTG
ATGGAAGGTA CGCACCCCTT TTCTGGCATT TATCAAGGCA TTCCTGAACC ACCACCTTAT
GAAGCCAGAA TTGCGTCGGG ACATTTCACT TATAGTAAGA AACGGCAAGT ACCTTACCTA
CCTACTCCTA TCGCCCCGCC TTGGGAAATT CTCCACCCCA GCTTACAAGC GCTGTTTATT
CGTTGTTTTG AGGATGGTCA CAACGAACCC CAACTGCGCC CTAATGCCCA AGCTTGGCTA
TCGGCTATAG CTGAGGCTGA AGATTCCCTG ACTACCTGTA CAGTTAATTC TCAACATCAC
TACAGCAATC ACCTGCACAG TTGCCCTTGG TGCGAACGTG CTTTACGCTT GGGTGGTCGT
GACCCGTTTC CTTCGGTGCA AGCGATTGAA AATAGAGAAC ATCTCCGTCC CCGCATCCCC
ACCAAGAGAC GCTACGGCCA CGGCAATCAG CCTGTTAACT TGCCGCAGCC AGTGATGCCG
ATGTACCAAA GCAACTGGCA CTCACCCACC CCCAGTTTTT CACCTTACCG CAATCGCTGG
AAAGGGAAGT TTTATCCGGT AGTTTTTTGT TTGTTGGGTT TTGGGGTTTT GGGATACTTG
GATGTGGTGA CAAAGTTCAC CAGTCCTTTG GTATCGCGCA ATAATTATGC TCAACAAGCG
CTGATGCCTA AACAAGCTAA TAGTAATACC GCCCTGAGTT TTGCAGAGTA TTATCAACAG
GGTCATGCTG CTTACCAAGT GCGTGATTAT AAACAGGCAG TAGATAACTT CACCCATGCC
ATTCAGCAGG AACCAACAAA CGCCAAAGCT TTGGTGAACC GAGGTAATGC GCGTTATAAC
CTGAAAGACT ATGAAGGTGC TTTAGCCGAT TACACTGTAG CTTTGCAAAT TAATCCCAAT
GAAATCAAAG CTTTTGTGAA TCGGGGTAAC TCCCGCTTGA TGCTGGCTGA ATATAGTAAT
GATCCTGACC AACAGTATAG ATTAGCGATC GCCGACTTTA ATCACGCCCT GAAACTCAAC
GAGAAGGAAG CCGAAGCTTA TATCCGCCGG GGAATTGTCC GGTCACAAAT GGCTAAATAT
AGCAGCGATA CCATTAAAGA TTACCAAGAA GCGATCGCCG ATTTTGACCA AGCCCTGAAA
CTTAACCCCG CTAAAACCGA AGCTTACTTT CAGCGTGCTT CTGTCCGCTA TCTCATTGCC
CAATATACAG GTGATTCGAC CAAGGAATAT GATCAGGCGA TCGCAGATTT TGATCAAGCA
TTAAAAATTA ACGACAAACT AGCCAAAGTG TATCTCAAAC GTGGTATGGT GCGCTACGAA
TTAGCACAAA TTACTAGTAA TAAATCTGAT GCCAATAATG CCAAAGCTCT TGCAGATTTA
CAGCTAGCCG CCAAACTTTC TTTAGAACAA GAAGATACGG AAAGTTATCA ACAAGCACTC
AGCAGTATCT GTATCATTGA GGAAAGCAAA TGTAATGCTT TATTCCAAAG CTCTACAATG
CGAGGATATG CCAGCACCGA CTTGACAGCA AAACAGTAA
 
Protein sequence
MKVLRYLPQE EIISLSVSLG RGGEACIYAV PSAGDCVAKI YHKPTVAHAS KLRAMLANPP 
ENPTASLGHI SIAWPQELLW GADESERVIG FLMPRIRGMR PIIDFYNPRT RRQHCPLFNY
QYLLRTARNL AAAFAALHNS GYSVGDVNES NILVSDTALV TLVDTDSFQV CDPDNDLVYR
CPVGKPEFTP PELQNKIFAH HDRQATHDLF GLGVLIFQLL MEGTHPFSGI YQGIPEPPPY
EARIASGHFT YSKKRQVPYL PTPIAPPWEI LHPSLQALFI RCFEDGHNEP QLRPNAQAWL
SAIAEAEDSL TTCTVNSQHH YSNHLHSCPW CERALRLGGR DPFPSVQAIE NREHLRPRIP
TKRRYGHGNQ PVNLPQPVMP MYQSNWHSPT PSFSPYRNRW KGKFYPVVFC LLGFGVLGYL
DVVTKFTSPL VSRNNYAQQA LMPKQANSNT ALSFAEYYQQ GHAAYQVRDY KQAVDNFTHA
IQQEPTNAKA LVNRGNARYN LKDYEGALAD YTVALQINPN EIKAFVNRGN SRLMLAEYSN
DPDQQYRLAI ADFNHALKLN EKEAEAYIRR GIVRSQMAKY SSDTIKDYQE AIADFDQALK
LNPAKTEAYF QRASVRYLIA QYTGDSTKEY DQAIADFDQA LKINDKLAKV YLKRGMVRYE
LAQITSNKSD ANNAKALADL QLAAKLSLEQ EDTESYQQAL SSICIIEESK CNALFQSSTM
RGYASTDLTA KQ