Gene Ava_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3303 
Symbol 
ID3680295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4123726 
End bp4125465 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content44% 
IMG OID637718654 
Producthypothetical protein 
Protein accessionYP_323806 
Protein GI75909510 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.1343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCAG TAGACTTTAC CACCCTCACA GCTACTTGTA GCGAACTCCG CGCTCACTGG 
CTACCATCCC GCCTAGAGCA AGTTTATCAG CGCGATCGCT ACACTATTGC TATAGCATTA
CGTACCCTCG ATAAAAGAGG TTGGTTACAA ATTTCTTGGC ATCCTCAAGC AACTCATATT
TGTATTGGTG ATCCACCTCC ACGCACACCA GATACCTTTA CCTTCAGCCA ACAACTAGTC
CACCAGTTGG GGGGATTAGC CTTAGTTGCA ATTGAAGCGA TCGCCCCTTG GGAGCGTGTA
ATTGATTTAC AATTTGCCCG TCGCCCTGGA GATGCTGCAC TGTATCACAT CTATGGGGAA
ATCATGGGTA AATACAGTAA CGTGATTCTC ACCGATGCCA ACAATCTAAT TATTACTGCT
GCCCATCAAG TGAGTCAGCA ACAATCAAGT GTGCGTCCCA TCCTCACCGG ACAACCTTAT
GAAACACCGC CAAAACTCAC CGGGACTATC CCCAGTTTGC AGGAAACTCA AGCACGTTGG
CAAGAAAGAG TCAGTTTAGT GCCAGGAGCA ATTAAACGTC AGTTGCTCAA AAGTTATAGT
GGCTTGAGTG CTGTGTTGGT AGAATCCATG TTATTGGTAG CCAACATTGC ACCAGAAACT
TCTACTGATT CCCTAACTCC TGAAGACTGG CAACGATTAT TTGCACGCTG GCAAGAATGG
CTACACACCT TAAATAGTGG TAAATTTCAA CCAGCTTGGA TGGCAGATGG ATATACAGTT
ATGGGTTGGG GTGCTGTTGC ACCAGTCAAA GATATCCAAA CATTAATCAA CCAATACTAT
ACCAAGCAAC TAAATCAACA ATTATTTGCC CAATTACGCC ATCAACTGAA TCAGAAATTA
AGTAATATTT TAGGCAAATT ACGCAATAAA GCCCAAACCT TTAGCGATCG CCTACAGCAA
TCAGATCGTG CTGATGAATA TCGCCAAAAA GCTGATTTAT TAATGGCGCA TCTGCAAAAT
TGGGAACCGG GGATGAAAGA AATTAGCATA CCTGATTTTG AGACAGGTGA GCCTATGGCG
ATCGCTCTTT CGCCTGATAA AAATGCTGTG CAGAATGCCC AAAATCTCTA CAAACAACAC
CAAAAACTCA AACGCGCCCG CATAGCCGTC GAACCGCTAC TGCAAGAAGT ACAAGCAGAA
ATCGATTATT TAGAACAAGT AGAAGCTGCT ATTGCCCAAA TAGATAACTA TCAAACAGCA
GAAGATTTGC AAGCTTTAGA AGAAATCCGC GACGAATTAA TTGGACAGAA ATATTTAGAA
GAGTTAGAGT ATCGTAGCCG CAATAACAAC GAAACTGCTA GCACTAACTT TCACAACTAT
CGTACCCCTA ATGGCTTCAC AGTCTTAATC GGGCGCAACA ATCGCCAAAA TGACCAATTA
ACATTTCGAG TAGCCGGAGA TTATGATTTA TGGTTCCATG CCCAAGAAAT CCCCGGAAGC
CATGTACTAC TACGTTTAGA ACCGGGTGCA ATACCAGAAG TATCAGACTT ACAATATGTA
GCTGATTTAA CAGCTTACTA CAGTCGCGGT CGTCAGAGTG ACCAAGTACC AGTCGTTTAC
ACCCAACCCA AACACGTTTA TAAACCCAAA GGAGCTAAAC CAGGAATTGC TATTTACAAA
CAGGAACGCA TCCTTTGGGG AAAACCGCAG TTAGTAGATA TAGAGAAAGT AGGAAGCTGA
 
Protein sequence
MQPVDFTTLT ATCSELRAHW LPSRLEQVYQ RDRYTIAIAL RTLDKRGWLQ ISWHPQATHI 
CIGDPPPRTP DTFTFSQQLV HQLGGLALVA IEAIAPWERV IDLQFARRPG DAALYHIYGE
IMGKYSNVIL TDANNLIITA AHQVSQQQSS VRPILTGQPY ETPPKLTGTI PSLQETQARW
QERVSLVPGA IKRQLLKSYS GLSAVLVESM LLVANIAPET STDSLTPEDW QRLFARWQEW
LHTLNSGKFQ PAWMADGYTV MGWGAVAPVK DIQTLINQYY TKQLNQQLFA QLRHQLNQKL
SNILGKLRNK AQTFSDRLQQ SDRADEYRQK ADLLMAHLQN WEPGMKEISI PDFETGEPMA
IALSPDKNAV QNAQNLYKQH QKLKRARIAV EPLLQEVQAE IDYLEQVEAA IAQIDNYQTA
EDLQALEEIR DELIGQKYLE ELEYRSRNNN ETASTNFHNY RTPNGFTVLI GRNNRQNDQL
TFRVAGDYDL WFHAQEIPGS HVLLRLEPGA IPEVSDLQYV ADLTAYYSRG RQSDQVPVVY
TQPKHVYKPK GAKPGIAIYK QERILWGKPQ LVDIEKVGS