Gene Ava_3774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3774 
SymbolileS 
ID3678978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4697681 
End bp4700617 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content46% 
IMG OID637719124 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_324274 
Protein GI75909978 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.229879 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.244596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCTT CTATAATTGA TAGAACCTGC ATACATAATC TTAGGCATGA AGCTGTGACA 
GAAACTGGAA GCTACAAAGA TACCGTAAAT TTACCCAAGA CAAATTTTGA TATGCGGGCG
AACGCCATCA AGCGCGAACC AGAAATCCAA AAGTTTTGGG AAGAAAATAA AATTTTTGAA
CGCCTGTCGC AAAATAATCC AGGTGAATTA TTTATACTGC ACGATGGGCC TCCCTACGCT
AATGGCTCAC TCCATATTGG TCATGCCTTA AATAAAATTC TCAAAGATAT TATTAATCGT
TACCAACTGC TGCAAGGTCG GAAGGTGCGA TATGTGCCTG GTTGGGACTG TCACGGCTTA
CCGATTGAGT TGAAAGTTTT GCAAAATCTC AAGTCAGCAG AACGGCAAAA TTTAACACCA
CTACAACTGC GGCAAAAGGC TAAAGAGTTT GCCTTGGCTA CTGTTGATGA CCAACGCCAA
AATTTTAAAC GTTATGGTGT TTGGGGTGAC TGGGATCATC CATATCTGAC CCTGAAGCCT
GAATATGAGG CGGCGCAGAT TGGTGTGTTT GGGCAGATGG TCTTAAAAGG ATATATATAT
CGGGGGTTAA AACCTGTTCA CTGGAGTCCC AGTTCTAAGA CAGCTTTGGC TGAGGCGGAA
TTAGAATATC CAGAAGGGCA TACTTCCCGG AGTATCTATG CGGCTTTTCC GGTAACTGGT
CTGGCTGAGG CTGTTAAGTC TGTTTTGGGT GAGTATTTGC CTGATTTGGG TGTGGCTATC
TGGACTACTA CGCCTTGGAC AATTCCGGGG AATTTGGCTG TGGCCGTCAA TGGCGACTTG
AATTATGCTG TGGTGGAAGT TGCACAGATA GATGTAGAGA CGCAAAGTAA TTTCAAGTAT
CTCATCGTGG CGGCGGAGTT GGTGGAACGG TTGGCGGCGA CTATCTCGGC GCAGTTGACT
GTGAAGGCTA CTTTTAAGGG TAAGGAATTA GAACATACTA CCTACCGTCA TCCTTTATTT
GACCGGGAAA GTCCGGTGGT TGTGGGTGGT GATTACATCA CAACTGAGTC GGGTACTGGG
TTGGTACATA CTGCCCCTGG TCATGGTCAA GAAGACTACG TAGTAGGTCT GCGTTATGGT
TTGCCAATTC TTGCACCTGT GGACGACAAT GGCGATTTTA CCCAGGAGGC GGGTGAGTTT
GCGGGGTTGA ATGTGTTGGG TGATGGCAAT CAGGCGGTCA TTGATGCGTT GACGGCCGCG
GGTTCCCTGC TGAAGGAAGA AGCTTATGCA CACAAGTATC CTTATGATTG GCGGACGAAA
AAGCCGACAA TTTTCCGGGC GACTGAACAG TGGTTTGCTT CGGTGGAAGG TTTCCGGGAT
GAGGCACTAA AGGCGATCGC TACTGTAAAA TGGATACCAG CCCAAGGTGA AAATCGTATC
ACGCCGATGG TGGCGGAACG TTCTGATTGG TGTATTTCCC GTCAGCGTTC CTGGGGTGTG
CCAATTCCGG TATTCTACGA TGAGGAAACC GGCGAACCTC TGCTAAATGA GGAAACTATC
AACTATGTAC AAGCCATCAT TGCCGAAAAA GGTTCTGATG CTTGGTGGGA GTTGTCGGTA
GAGGAATTAT TACCAGAGTC CTACAGAAAT AATGGTCGGT CTTACCGCAG AGGTACAGAC
ACAATGGACG TATGGTTTGA TTCTGGTTCT TCTTGGGCTT CCGTAGTGAA GCAGCGTCCA
GAGTTACGCT ACCCGGCTGA TATGTATTTG GAAGGTTCCG ACCAACATCG CGGTTGGTTC
CAGTCGAGTT TGTTGACTAG TGTGGCGGTA AATGGCATTG CACCTTACAA AACTGTATTA
ACTCATGGCT TTGTTTTGGA TGAACAAGGA CGGAAGATGA GTAAATCAGA AGGAAATGTA
GTTGACCCAA AAATTCTCAT TTCTGGGGGT AAAGACCAGA AGAAAGAACC CCCCTATGGT
GCAGATGTTA TGCGGTTGTG GGCATCTTCT GTAGATTACA CTGGTGATGT GCGTTTGGGT
GGCAATATCA TCAAGCAACT CAACGATGTC AGAGGTAAAA TTCGCAATAC GGCGCGGTTC
TTGCTGGGTA GTTTGCATGA CTTTGACCCA GAAAAAAATG CTGTACAGTT TGAGGAAATG
CCGCAGTTAG ATAGATATAT GCTGCACCGC ATCCGTGAGG TGTTTCAAGA AGTGACGGAA
GCTTTTGAAA GTTTCCAATT CTTCCGCTTT TTCCAAACGG TACAGAATTT CTGCGTGGTG
GATTTATCCA ACTTCTACTT AGATGTTGCC AAGGATAGAC TATACATCAG CGCCCCTGAT
GCTTTCCGTC GCCGCAGTTG TCAGACAGTG ATACACATTG CACTACAAAA TTTAGCACGA
GCGATCGCCC CTGTACTCTG TCACACTGCT GAAGATATCT GGCAATATCT CCCCTACAAA
ACACCATACA AATCAGTATT TGAAGCTGGT TGGGTGCAGG TAGAGAAAAA ATGGCATAAT
CCAGAGTTGG CGGAATTTTG GCAACAATTA CGCCAGTTAC GCACCGATGT TAACAAGGTG
TTAGAACAAG CTAGGGTAGA AAAAATGATT GGTTCTTCCC TAGAGGCGAA AGCTTTGATT
TACGTCAAAG ATGCCAACTC TCGCAACGCC ATCGCCACTT TAAATCCTGA AGTTGGTAAC
GGCGTAGATG AACTGCGTTA TTTATTCCTA ACATCCCAAG TAGAATTATT AGATTCTGCT
GACAAACTGC AAGATGGGAA ATATACCTCC CAGTCTGATA ACTGGGGAAT TGGGGTAGTG
AATGCAGAAG GGCAAAAATG CGATCGCTGT TGGAACTACT CCACCCATGT GGGAGAATCA
CAAGAGCATC CCCTACTCTG TGAACGCTGC GTTCCTGCCT TAGCTGGCGA GTTTTAG
 
Protein sequence
MPSSIIDRTC IHNLRHEAVT ETGSYKDTVN LPKTNFDMRA NAIKREPEIQ KFWEENKIFE 
RLSQNNPGEL FILHDGPPYA NGSLHIGHAL NKILKDIINR YQLLQGRKVR YVPGWDCHGL
PIELKVLQNL KSAERQNLTP LQLRQKAKEF ALATVDDQRQ NFKRYGVWGD WDHPYLTLKP
EYEAAQIGVF GQMVLKGYIY RGLKPVHWSP SSKTALAEAE LEYPEGHTSR SIYAAFPVTG
LAEAVKSVLG EYLPDLGVAI WTTTPWTIPG NLAVAVNGDL NYAVVEVAQI DVETQSNFKY
LIVAAELVER LAATISAQLT VKATFKGKEL EHTTYRHPLF DRESPVVVGG DYITTESGTG
LVHTAPGHGQ EDYVVGLRYG LPILAPVDDN GDFTQEAGEF AGLNVLGDGN QAVIDALTAA
GSLLKEEAYA HKYPYDWRTK KPTIFRATEQ WFASVEGFRD EALKAIATVK WIPAQGENRI
TPMVAERSDW CISRQRSWGV PIPVFYDEET GEPLLNEETI NYVQAIIAEK GSDAWWELSV
EELLPESYRN NGRSYRRGTD TMDVWFDSGS SWASVVKQRP ELRYPADMYL EGSDQHRGWF
QSSLLTSVAV NGIAPYKTVL THGFVLDEQG RKMSKSEGNV VDPKILISGG KDQKKEPPYG
ADVMRLWASS VDYTGDVRLG GNIIKQLNDV RGKIRNTARF LLGSLHDFDP EKNAVQFEEM
PQLDRYMLHR IREVFQEVTE AFESFQFFRF FQTVQNFCVV DLSNFYLDVA KDRLYISAPD
AFRRRSCQTV IHIALQNLAR AIAPVLCHTA EDIWQYLPYK TPYKSVFEAG WVQVEKKWHN
PELAEFWQQL RQLRTDVNKV LEQARVEKMI GSSLEAKALI YVKDANSRNA IATLNPEVGN
GVDELRYLFL TSQVELLDSA DKLQDGKYTS QSDNWGIGVV NAEGQKCDRC WNYSTHVGES
QEHPLLCERC VPALAGEF