Gene Ava_3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3803 
Symbol 
ID3678773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4737473 
End bp4738717 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content46% 
IMG OID637719154 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_324303 
Protein GI75910007 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.013575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCTC AAATCAACAT GACTTCATTA CTTCCTCGAA ATCTTAGCGT TGTCATCCGG 
CCAGTCTATT ATCGGGACTT GGACGGAATT GAGCGAATAT CTCAAGAATC CTTCGCAGCT
CATACCCCCC AAGGAGCTAG TTCTATTGCT AATCGGATGC AATGGTTGCG TCGCTGGTAT
GGGTTACTCA AGTTTTTGAG TTGGTTCCCT AACCCGCTAC AATACCGCTT TTGCGCCTAT
GTAGCCGAAC AGGGGCGGAT GCTCTTAGGG ATGATTCAAG TTTCGCCGTT TAACCGGACA
CGCAGCACTT GGCGAGTTGA CCGGGTGATT TTGGATCGGG CTGTCGATAA GCAGGGAATT
GGTTCACAGC TACTACGCCA CTGTTTTGAA GGGATTTTAG AAGCTCGTAC TTGGTTGTTA
GAAGTTAATG TCAATGATAC AGATGCACTA GCGCTTTATC GGCAAAATGG ATTTCAGCGT
TTAGCAGAAA TGACATATTG GGAAATAGAT CCCGAATTAT TAAGTGAATT AGCGCAAGCA
GAGCCAGATT TACCCAATCT TTTACCAGTC AGCAATGCGG ATGCTCAGTT GTTGTATCAA
TTGGATACAG CATCGATGCC ACCGTTGGTA CGTCAAGTAT TCGATCGCAA TACCCGCGAT
TTTAAAACCA GTTTGTTCGG CGCTTTAAGA GATGCAGTCA AACAATGGGT GACAAAAATT
GAAGTTGTAA GCGGCTACGT GTTTGAACCA CAACGCAAAG CAGCAATAGG TTATTTCCAG
TTACAGCTAG ACCGCAAAGG TGAAACTCCC CACGTTGCCA CCTTGACAGT CCACCCTGCT
TATACTTGGC TATACCCAGA ATTATTATCT CAACTGGCGC GAATTGCCCA AGATTTTCCC
CAACAAGGTT TACAACTAGC CTCCTCTGAT TACCAACCAG AGCGAGAAGA ATATTTAGAA
CGCATTGGTG CCAAGCGCAT AGAACACACG CTGATCATGT CTCGTTCAGT CTGGCACAAA
CTGCGGGAGT CAAAATTTGT CTCTCTAGAA GGGATTCAAT GGACTGATGT TCTCCAAGGA
CTGCAACCTG CGCGCAAACC CATCCCTGGG GGAATGTCCT GGGTACACAC AAGACAGCAA
TCATCCCCAG ATATCCCAGT ACCCAGTTCA TCAGAACCAA TGGCCTTTGG GATTAAAGAT
GTACCCAATC AGCCAGATTC AGAAGAAGGG GAGATTGGGG AGTAG
 
Protein sequence
MAAQINMTSL LPRNLSVVIR PVYYRDLDGI ERISQESFAA HTPQGASSIA NRMQWLRRWY 
GLLKFLSWFP NPLQYRFCAY VAEQGRMLLG MIQVSPFNRT RSTWRVDRVI LDRAVDKQGI
GSQLLRHCFE GILEARTWLL EVNVNDTDAL ALYRQNGFQR LAEMTYWEID PELLSELAQA
EPDLPNLLPV SNADAQLLYQ LDTASMPPLV RQVFDRNTRD FKTSLFGALR DAVKQWVTKI
EVVSGYVFEP QRKAAIGYFQ LQLDRKGETP HVATLTVHPA YTWLYPELLS QLARIAQDFP
QQGLQLASSD YQPEREEYLE RIGAKRIEHT LIMSRSVWHK LRESKFVSLE GIQWTDVLQG
LQPARKPIPG GMSWVHTRQQ SSPDIPVPSS SEPMAFGIKD VPNQPDSEEG EIGE