Gene Ava_3071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3071 
Symbol 
ID3681051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3810486 
End bp3811655 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content46% 
IMG OID637718416 
Productinosine/uridine-preferring nucleoside hydrolase 
Protein accessionYP_323575 
Protein GI75909279 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1957] Inosine-uridine nucleoside N-ribohydrolase 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.474461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAAA TTCCTAATGT GCAGAAGTTG TTTTCTACTG CTGCTTCTTT AGTATCAATC 
ACAGCAATTT TTTGTAGCCA ACCTGTACTT GCAGCCTCTT TTAAGCCAAC CCCCCTAATC
ATCGACGACG ATGGCAGCCA AGACGGCATG ACTGCATTGG CTTATATGCT AGCCAATCCC
AAATTTGATG TCCAAGCAAT TACCATCGCC CAAGGTATAG CCCGCCCAGA AAGCTTTGTG
AACAACCTGG AACGGATGCT AGGCAGACTA AATGCTTCTG GCATCCCTGT TGGTATCGGC
AGATCCACTC CCCTGGCAGG AAATAATACT TTCCCAGAAT TTATTCGCAC TGGTGCAGAC
ACTTTTTGGT CTCCCTTCGT CCAACTACCT GATACAGCAC CACCTATAGT AACTCGACCA
GCCGCAGAAC TGATTGTGGA GAAAGTGAAG CAGTCATTAG CACCTGTAGC AATCTTGGCA
ACTGGATCTT TAACCAATAT TGCTGAAGCA TTACGGCTTG ACCCCACCAT TATCAACAAC
ATTGCCATCA TCGAAATCAT GGGAGGCGCA GTTTTCGTAC CTGGAAATCT CCCAGTCCTG
CCTGATCCCC CATTTTCTAC CAACACGACA GCTGAGTTCA ACATCTGGGT TGACCCTTTA
GCAGCACAAG AAGTATTTGC AGCCGGAGGG CAAGGATTAA AAATTCAGTT GACCCCCCTG
GATGCTACAA ACCAGATTGC CTTTTCTCGT GCCGATCAAC AAGCATGGCT AGCTACTGCA
ACACCAGAAA GTAAGTTAGC AGCAGAATTT TTAGACTTTG CCTTGACCAT AATTCAAAGT
AACAATGACC CCAACCCAGC TTGGGATCTA GTTGCAGCCA TTAACTTGAG TGAACCAGAT
TTCTCAGTAG AAACTCCTTT ATACTTAGAA GTTGATACGA CCTCAGATCC TGGGGGTACT
CAAGGGCAAA CTCGTGCTAT TTCTAATTTG CCCCCCAATG TTCTAGTTTC CCTCAACCCC
AGTTTTAATA ATTTGCCCTT TCGACCAGGC CAAGTCTTCT CTTACCTACA AACCCAGTCT
GTTCCCGAAC CAACATCAAT TGCAGGAATC TTACTTCTAG CCACAGTCAG TGCTGGTATG
ATGGCGCGAC GTTCTCAGAA AAAAGTTTAG
 
Protein sequence
MLKIPNVQKL FSTAASLVSI TAIFCSQPVL AASFKPTPLI IDDDGSQDGM TALAYMLANP 
KFDVQAITIA QGIARPESFV NNLERMLGRL NASGIPVGIG RSTPLAGNNT FPEFIRTGAD
TFWSPFVQLP DTAPPIVTRP AAELIVEKVK QSLAPVAILA TGSLTNIAEA LRLDPTIINN
IAIIEIMGGA VFVPGNLPVL PDPPFSTNTT AEFNIWVDPL AAQEVFAAGG QGLKIQLTPL
DATNQIAFSR ADQQAWLATA TPESKLAAEF LDFALTIIQS NNDPNPAWDL VAAINLSEPD
FSVETPLYLE VDTTSDPGGT QGQTRAISNL PPNVLVSLNP SFNNLPFRPG QVFSYLQTQS
VPEPTSIAGI LLLATVSAGM MARRSQKKV