Gene Ava_1563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1563 
Symbol 
ID3681106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1924290 
End bp1926056 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content41% 
IMG OID637716903 
Productextracellular solute-binding protein 
Protein accessionYP_322081 
Protein GI75907785 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.176735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.279007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATCT TGAATAAATT TCGTCAGGGT CAAATTTTTA CTTGGTCGTT ACTAAATATA 
TTTTTCCTTG CTGGTTGTTC TTTTCCTCAG GCGGAAACTC CTACAAACTC CACACCCGTA
ACAAACACTA GTACAGGCGA AACCTTACGT CTACTTTATT GGCAAGCACC AACTATTCTC
AATCCTCACC TAGCGCAGGG AACTAAAGAC TTTGAAGCTA GTCGTATCGT CTATGAACCC
CTCGCCAGCC ACGATAAAGA CGGTAAATTG GTTCTGTTTT TAGCCGCAGA GGAACCTACT
CTAAAAAATG GTGGTATAGC CAAAGATGGT AAATCAGTTA CCTGGAAACT CAAGCAAGGA
GTCAAATGGT CTGATGGTCA ACCTTTTACA GCTGCGGATG TGGTATTTAC TTACAAATTT
CTTTCCAATC CGGCTGTCGG TGCTACCACT TCCGCTAATT ACGAAGCTGT GCAAAGCGTC
GAAGCCATCG ACGATTACAC TGTCAAAATT AATTTCCAGA GTCCTAACCC AGCTTGGTCA
CTACCTTTTG TGGGCTTAAA TGGAATGATT ATTCCCCGTC ACATTTTTGA GAAATTTAAC
GGTAGTAACG CTAGGGAAGC GCCAGGTAAT TTGATTCCTA TAGGTACAGG CCCTTATAAA
GTCGGAGAAT TTAAACCCGG TGACACCATT ATCTATGAAG CTAATTCTGT GTTCCGTGAA
GCCAATAAAC CTTTCTTTAA GCGAGTAGAA CTCAAGGGAG GTGGTGATGC GACATCAGCC
GCGAGAGCAG TACTACAAAC TGGAGATGTA GACTACGCTT GGAACCTGCA AGTAGAAGCC
CCGATTCTCA AGCAACTAGA AGCAGCAGGC AAAGGGAAAT TAAAAATTAG TTTTGGTTCT
TTTTTAGAAC GGATTACCAT CAATCATACC GACCCTAATA AACAAACAAA AGACGGCGAA
CGTTCTAGCA CTGAATTTCC TCATCCATTT TTTCAAGACA TCAAAGTGCG TCAAGCATTT
AACTATGCAA TTGATCGGGA CACAATAAAT CAACAATTAT ATGGTTCTAG TGGTCGTCCT
GCCGCCAATA TCCTATTAGC ACCAGAGATT TATAACTCAC CTAATACTAA ATATGAATTT
AGCCCCAAGA AAGCTACTGA TTTATTAGAT GAAGCTGGAT GGAAAGATAC AAATGGTAAT
GGTATTCGAG ACAAAAATGG TGTAGAGATG AATGTTTTGC TTCAGACATC TGTAAATCCA
GTGAGACAGA AAACTCAGGA AATTATTAAA CAAGGATTAA CCTCTATTGG TGTTGGGGTG
GAACTAAAAA GTATTGATGG TAGTATCTTC TTTTCTGGAG ACCCATCGAA CCCAGACACC
TTGGGAAGGT TTCAAGCTGA TTTACAAATG TTTAGTACGG GTAGTACGAA TGTAGATCCT
GGTGCTTATA TGAAAGGCTT TACTTGTAGC GAAATTCCCC AGAAAAAGAA TAACTGGTCA
AAATCCAATC ATTCACGTTA CTGTAATCCT GAATATGATA AGCTCTGGCA ACAGTCCAAC
ACAGAATTAA ATCCTGAAAA ACGTCGGCTG CTATTTATTC AGATGAATGA TCTGCTATTC
AAAGATATTG CTTTAATTCC CTTGATTGCC CGTGCTGATG TCAATGGCGT GAGCAATAGA
CTGGTCGGTG TAGATTTGAC CCCTTGGGAT ACTGATACAT GGAATATTAA AGATTGGCAA
CAAGTCCAGT CTCTTGGTAA TAGGTAA
 
Protein sequence
MGILNKFRQG QIFTWSLLNI FFLAGCSFPQ AETPTNSTPV TNTSTGETLR LLYWQAPTIL 
NPHLAQGTKD FEASRIVYEP LASHDKDGKL VLFLAAEEPT LKNGGIAKDG KSVTWKLKQG
VKWSDGQPFT AADVVFTYKF LSNPAVGATT SANYEAVQSV EAIDDYTVKI NFQSPNPAWS
LPFVGLNGMI IPRHIFEKFN GSNAREAPGN LIPIGTGPYK VGEFKPGDTI IYEANSVFRE
ANKPFFKRVE LKGGGDATSA ARAVLQTGDV DYAWNLQVEA PILKQLEAAG KGKLKISFGS
FLERITINHT DPNKQTKDGE RSSTEFPHPF FQDIKVRQAF NYAIDRDTIN QQLYGSSGRP
AANILLAPEI YNSPNTKYEF SPKKATDLLD EAGWKDTNGN GIRDKNGVEM NVLLQTSVNP
VRQKTQEIIK QGLTSIGVGV ELKSIDGSIF FSGDPSNPDT LGRFQADLQM FSTGSTNVDP
GAYMKGFTCS EIPQKKNNWS KSNHSRYCNP EYDKLWQQSN TELNPEKRRL LFIQMNDLLF
KDIALIPLIA RADVNGVSNR LVGVDLTPWD TDTWNIKDWQ QVQSLGNR