Gene Ava_1829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1829 
Symbol 
ID3681937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2275287 
End bp2276498 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content42% 
IMG OID637717169 
ProductIS891/IS1136/IS1341 transposase 
Protein accessionYP_322346 
Protein GI75908050 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAAA AAGCATTCAA GTACCGATTC TATCCAACTC CTGAGCAAGA AACCTTGCTC 
AGAAGAACAA TGGGATGTAC TCGTTTAGTT TACAACCTTG CCCTTTCTGC AAGAACGCAG
GCATGGTATG AGCATCAAGA ACGAGTCGGG TACATTGAAA CTTCAGCAAT GCTGACTAGT
TGGAAGAAGC AAGAAGACTT GCAATTTCTT AATGACGTTA GCAGTGTGCC ATTACAGCAA
AGTTTACGAC ATCTACAAAC GGCATTTAGC AACTTCTTTG CAGGTCGGAC TAAATACCCC
AACTTCAAGA AAAAGCATAA TGGTGGCAAT GCTGAATTTA CTTCATCAGC GTTTAAGTTT
AGAGATGGGC AAATATTTCT GGCTAAAAGT CCTACAGCGT TAGACATTCG TTGGAGCCAA
CAGCTACCTC AAGGTATAGA ACCATCTACT ATTACGGTAA AACTATCGCC CTCTGGACGC
TGGACTGTTT CAATGTTGGT AGATGTCGAA ATTCAAAAAT TACCTGAATC TTTAACTCAA
GTTGGCGTTG ACTTGGGTAT CACTAGCTTA GTTGCATTGA GTACAGGCGA AAAGATTAGT
AATCCCAAAA GCTTTAAAGC AAAAAAAGCG AAATTGCGTA AAGCGCAAAA AGCTTTGAGC
CGTAAACAAA AAGGATCTAA CAATCGTCAC AAAGCAAGGA TGAAGGTCGC TAAAGTTCAT
ACAGAAGTTA GTGATGCTCG TCATGATTTT CTTCACAAAT TGACAACCAG ACTGGTTCGA
GAAAACCAAT TGATCGCAGT TGAAGATTTG TCTGTGAAGA ATATGGTTAA AAACAAAAAA
CTCGCTTTTT CAATTAGTGA TGCCAGTTGG GGCGAGTTGG TCAGGCAACT TGAGTATAAG
TGCGATTGGT ATGGTCGCAC TCTCATCAAG ATTGACCGAT GGTTTCCGAG TTCTAAGCGA
TGTGGCAATT GTGGGCATAT CGTTGAAAAA CTGCCATTGA ATGTACGAGA GTGGGATTGC
CCTAAATGTC AGGCACACCA CGACCGAGAC ATCAACGCCA GTAAGAATAT TTTGGCTGCG
GGACTCGCAG TTTCAGTCTG TGGAGCGAAC ATAAGACCTG ACAGACTTAA GTCTCAAGGG
CAGTTGCAAA AAACCCGTAA GGCTTGCCTT GAGCGTAGCC GAAAGGGACA GAAACAGAAA
CCTAAGTCGT GA
 
Protein sequence
MTQKAFKYRF YPTPEQETLL RRTMGCTRLV YNLALSARTQ AWYEHQERVG YIETSAMLTS 
WKKQEDLQFL NDVSSVPLQQ SLRHLQTAFS NFFAGRTKYP NFKKKHNGGN AEFTSSAFKF
RDGQIFLAKS PTALDIRWSQ QLPQGIEPST ITVKLSPSGR WTVSMLVDVE IQKLPESLTQ
VGVDLGITSL VALSTGEKIS NPKSFKAKKA KLRKAQKALS RKQKGSNNRH KARMKVAKVH
TEVSDARHDF LHKLTTRLVR ENQLIAVEDL SVKNMVKNKK LAFSISDASW GELVRQLEYK
CDWYGRTLIK IDRWFPSSKR CGNCGHIVEK LPLNVREWDC PKCQAHHDRD INASKNILAA
GLAVSVCGAN IRPDRLKSQG QLQKTRKACL ERSRKGQKQK PKS