Gene Ava_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1810 
Symbol 
ID3681977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2254064 
End bp2255851 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content41% 
IMG OID637717150 
Productextracellular solute-binding protein 
Protein accessionYP_322327 
Protein GI75908031 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATATA CAAATAATCT CATTTCCTTA ATTAAGCGTT TTTGGATACT CATAATTTTG 
GCTGCATTTA CAGCAGTTAC AGTTGCAGCT TGTAACCCAT CGAATTTTAA AAGTTCGGCT
GCTCAAATCC CGCAATTAGT AACTAGTATT CTCAGTGATC CTAAAACTTT TAACTATCCT
TTAAGTTCGG AATCACCTAA TGTTTTTGGT TTGATTTATG AGGGATTAAT CAGCGAAAAT
TATGATACTG GTGAAGTGGA ACCAGCTTTA GCAGAATCTT GGACAATTTC TGATGATAAA
TTAAAAATTG TTTTTACTCT CCGTGAGGGT TTAAAGTGGT CGGATGGGCA ACCACTAACT
GTAGATGATG TTGTATTTAC TTACAATGAC ATTTACTTTA ACGAAGCCAT TCCTACAGAT
GTTAGAGATA TTATGAGGAT TGGTGAAAGT CGGAAACTGC CAACTGTGAG AAAAGTTGAT
AGTCGTCGAG TTGAGTTTGC TGTTCCCGAA CCGTTTGCGC CTTTTTTACG TAGTGCGACG
AGTGCGGCAA TCTTACCAGC CCATGCACTG CGAGAATCTA TACAAACCAA AGATAGTGAG
GGTAAGCCTA AGTTTCTGCA AAAATGGGGG GTAGATACAC CACCAGACCA AATCGTCGGG
AATGGTTTGT ACAAATTGGA GCGTTATGAC ACCAGTGAAC GTGTAGTTTT CCGACGTAAT
CCTTACTATT GGCGTAAAGG GCCTAAAGGT GAAGCTCAAC CTTATATTGA ACGATTAGTG
TGGCAAATTG TCGAATCAAC AGATACCTCG TTACTCCAGT TTCGCTCTGG TGGTTTGGAT
AGTATTGGTG TTTCCCCAGA CTATTTTTCT CTGCTGAAGG TGCAAGAAAA GCAAGGCAAT
TTCAAGATAT ATAATGGCGG CCCGGCGGCT GGGACAACTT TTATATTATT TAACTTGAAT
AAGGGTCAAA GAGACGGTAA ACCACTGGTT GATCCAGTAA AGTCTCGTTG GTTTAATACG
GTAGAATTTC GCCAAGCTGT GGCTTATGCA GTTGACCGCC AAACGATGAT TAATAATATT
TATCGGGGTT TGGGTCAAAC GCAAGATTCA CCAATTTCTG TGCAGAGTCC TTATTATTTG
TCGCCCAAAG AAGGGTTAAA GGTTTACGAT TACAACTTAG AAAAAGCCAA GCAATTATTA
TTGAAAGCGG GCTTTAAATA TAATGCTCAA AATCAGTTGT TAGACTCTGA CGGTAATCGA
GTCCGCTTTA CACTGCTGAC GAATGCTGGT AACAAGATTC GTGAGGCCAT GGGTTCGCAA
ATTAAACAGG ACTTGAGCAA AATCGGCATA CAGGTAGATT TTACACCCTT GGCATGGAAT
ACTTATACAG ACAAGCTGGC GAATACTTTA GATTGGGAAG CTTCTATGCT GGGTTTGACT
GGCGGTTTAG AACCGAATGA TGGTGCTAAC GTCTGGAATC CCGAAGGGGG ATTACATATG
TTTAACCAAA AGCCCCAAGC AGGACAAAAA CCCATCACAG GTTGGGAAGT AGCACCGTGG
GAAGCGAAAA TTGGTCAACT ATACATTCAA GGTGCTAGGG AGTTGGACGA AACCAAACGT
AAAACAATCT ATGCCGAAAC CCAAAAAATC ACTCAAGAGA ATTTACCATT CATTTACTTG
GTAAATCCAT ATTCATTATC CGCAGTACGC GATCGCTTTG CAGGTATTCG CTTCTCAGCA
CTAGGCGGCG CATTCTGGAA CTTGTACGAA ATCAAAATAG TTAAGTAG
 
Protein sequence
MPYTNNLISL IKRFWILIIL AAFTAVTVAA CNPSNFKSSA AQIPQLVTSI LSDPKTFNYP 
LSSESPNVFG LIYEGLISEN YDTGEVEPAL AESWTISDDK LKIVFTLREG LKWSDGQPLT
VDDVVFTYND IYFNEAIPTD VRDIMRIGES RKLPTVRKVD SRRVEFAVPE PFAPFLRSAT
SAAILPAHAL RESIQTKDSE GKPKFLQKWG VDTPPDQIVG NGLYKLERYD TSERVVFRRN
PYYWRKGPKG EAQPYIERLV WQIVESTDTS LLQFRSGGLD SIGVSPDYFS LLKVQEKQGN
FKIYNGGPAA GTTFILFNLN KGQRDGKPLV DPVKSRWFNT VEFRQAVAYA VDRQTMINNI
YRGLGQTQDS PISVQSPYYL SPKEGLKVYD YNLEKAKQLL LKAGFKYNAQ NQLLDSDGNR
VRFTLLTNAG NKIREAMGSQ IKQDLSKIGI QVDFTPLAWN TYTDKLANTL DWEASMLGLT
GGLEPNDGAN VWNPEGGLHM FNQKPQAGQK PITGWEVAPW EAKIGQLYIQ GARELDETKR
KTIYAETQKI TQENLPFIYL VNPYSLSAVR DRFAGIRFSA LGGAFWNLYE IKIVK