Gene Ava_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0234 
Symbol 
ID3682998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp296554 
End bp297828 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content43% 
IMG OID637715562 
Producttransposase IS4 
Protein accessionYP_320755 
Protein GI75906459 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAGCTC GTTTCCAAGA ACTAGGTTGG CTCAAAAATC GCGGCCGTGT CAGAACTGAT 
TCAACTCACG TATTAGCCGC AGTACGACAG TTAAATCGTT TGGAATTAGT GGGAGAAACT
TTACGTCATA CCTTAAATGA CTTGGCTTAT TTTGCCCCTG ATTGGCTCAA ATCGAGAGTT
GACGTTGATT GGTTTGAACG TTACTCCCTG AGATTTGAGC AATACCGCTT GCCCAAATCA
AAAGCCGAAC GTGAGAAATT GAGGCGAAAA ATTGGTGAGG ATGGTCATCA TTTGCTATCC
GCTTTGTATG CAGACTCAAC TTGTAATTGG CTGTGGCAGA TTCCATCAGT GGAAACATTA
CGTATAGTTT GGGTGCAACA ATACTATATT CAATTGCAAC AAGTCTATTG GCGAGAACAA
GATAACTTAC CACCAAATAG ACTACAGATT GAATCTCCTT ACGATGTTGA TGCACGCAAT
TCCAGCAAGC GAGAAATCAA CTGGACTGGT TATAATCTGC ATCTGACAGA AATTTGTCAC
CCCATACTGC CAAACTTAAT TATCAATGTG GAAACGTCCG TGGCCACAAG TGCGGATGTT
GAGATGACAC CAGTAATTCA TTCTCGTTTA AACCAGAACA ATCTTTTGCC ACAAGAACAT
GTTGTCGATA CTGGCTATGT CAATGCTCAA AACTTAGTCG ATAGTCAATC CCATTTTCAT
GTTGATTTAG TAGGAAAAGT TCCCCCCGGA ACTAGTTGGC AAGCAACAGC ACAATCCGGC
TTTGAGCAAA ATTGCTTCAC TATTCATTGG GATTTGATGC GTGTTGATTG CCCAATGGGT
AAACAAAGTA AGTCCTGGCG TACAACTGTC GATAGCCATG ACAATCCAGT AGTCAAAATA
CAATTTGACA AATCCGATTG TTCGCTTTGT TCAAGTCGCT CAAAATGCAC TCGCTCCAAA
AAACTACCGC GTCTTCTGAC CCTCAAACCA CAGGAACTAC ATCTTGCATT ACATGATGCT
CGCATTCGCC AAAAAACTGA ATCTTTTCAA CAAATTTATC ACCAACGTGC TGGCGTTGAA
GGCTTGATTT CCCAAGCTAC TGGTCGCTAC CAATTACGCC GTTGTCGCTA CATTGGTCTT
GCCAAAACTC TCTTGCAGCA TGTCATTACT GCTGCTGCTA TCAACTTCAG TCGGATGTGG
GATTGGTGGC AACATGTCCC ACGCAGTCAG ACTCGCGTTT CTCACTTTGC TCGAATTGCT
CCCACTGCCT CATAG
 
Protein sequence
MLARFQELGW LKNRGRVRTD STHVLAAVRQ LNRLELVGET LRHTLNDLAY FAPDWLKSRV 
DVDWFERYSL RFEQYRLPKS KAEREKLRRK IGEDGHHLLS ALYADSTCNW LWQIPSVETL
RIVWVQQYYI QLQQVYWREQ DNLPPNRLQI ESPYDVDARN SSKREINWTG YNLHLTEICH
PILPNLIINV ETSVATSADV EMTPVIHSRL NQNNLLPQEH VVDTGYVNAQ NLVDSQSHFH
VDLVGKVPPG TSWQATAQSG FEQNCFTIHW DLMRVDCPMG KQSKSWRTTV DSHDNPVVKI
QFDKSDCSLC SSRSKCTRSK KLPRLLTLKP QELHLALHDA RIRQKTESFQ QIYHQRAGVE
GLISQATGRY QLRRCRYIGL AKTLLQHVIT AAAINFSRMW DWWQHVPRSQ TRVSHFARIA
PTAS