Gene Ava_C0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0022 
Symbol 
ID3677787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp38644 
End bp39768 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content39% 
IMG OID637715106 
ProductIS891/IS1136/IS1341 transposase 
Protein accessionYP_320300 
Protein GI75812683 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0682422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACAC TGAAGTTTAA GCTATATCAA CACAAAAGAA ATAGACACCT TAAGCGCATT 
ATTAATGCGG CTGGGGTAAT CTACAATCAT TGCATTGCTC TACACAAACG CTACTACAGA
ATGTGGGGCA ATCACTTGAG TTGTGCAAAA CTTCAGTCTC ATATTGCCAA ATTAAGAAAA
CGTAATTTAT TCTGGCAATC GCTAGGTTCT CAAGCAGTAC AAGATATCTG TCAACGCATA
GAGAAAGCCT ATCAACTATT TTTTAAACAC AATAAAAAAG GAGTTAGACC ACCAGGATTT
AAAAAGGTTA AAAAATACAA ATCATTCACT CTCAAACAAG CTGGTTATAA GTTTTTAGGT
AGCAACAGGG TGAAAATTGG GCATCGAGTA TATCAATTTT GGAAGTCTAG AGACATTGAG
GGAACAGTTA AAACCTTAAC CATTAAACGC ACACCATTAG GTGAATTGTT TATGGTTGTA
GTGGTTGATA ATGTTAGTCA AGCAGAAATT GAAGTTAAGA CGGGTAAAAT CGCTGGCTTT
GATTTTGGGT TGAAAACATT CCTCACTTGC TCAGACGGTA CTAAAATTGA ATCACCCCAA
TTTTTCAAGC AGTCACTCAA CGCCATCAAA AAAGCCAGCA GACAGCATTC CAAAAAACTA
AAGAGTTCAT CCAACAGGGA GCGAGCTAGG AAAAATTTGG TACGCAAATA TGAAGATATT
TCTCATCGTC GGCGTGATTG GTTTTGGAAA TTGGCCCATG AACTAACAGA TAAGTTTGAT
ATACTGTGTT TTGAGACGCT GAACCTCAAG GGAATGCAAC GACTTTGGGG AAGAAAAATA
TCAGATTTGG CGTTTGGCGA GTTCCTACAA ATTTTAAAAT GGATTGCTAA AAAGAAGAAT
AAACTGGTTG TTTTCATCGA CCAGTGGTAT CCATCCACTA AGACTTGCTC TGGCTGTGGA
CACGTTCTAG AAGAGTTGGA TTTATCTATT AGAGAATGGC GTTGCCCATC TTGCCAATCA
GTAAATGGAA GGGATGAAAA CGCATCTAAA GTAATTTGTG CAGTCGGGGC ATCGACTGTT
GGGTTAGGGG ATGTAAGTCG GTATGAAACT GCTATCGCTG TTTGA
 
Protein sequence
MKTLKFKLYQ HKRNRHLKRI INAAGVIYNH CIALHKRYYR MWGNHLSCAK LQSHIAKLRK 
RNLFWQSLGS QAVQDICQRI EKAYQLFFKH NKKGVRPPGF KKVKKYKSFT LKQAGYKFLG
SNRVKIGHRV YQFWKSRDIE GTVKTLTIKR TPLGELFMVV VVDNVSQAEI EVKTGKIAGF
DFGLKTFLTC SDGTKIESPQ FFKQSLNAIK KASRQHSKKL KSSSNRERAR KNLVRKYEDI
SHRRRDWFWK LAHELTDKFD ILCFETLNLK GMQRLWGRKI SDLAFGEFLQ ILKWIAKKKN
KLVVFIDQWY PSTKTCSGCG HVLEELDLSI REWRCPSCQS VNGRDENASK VICAVGASTV
GLGDVSRYET AIAV