Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_C0011 |
Symbol | |
ID | 3677776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007412 |
Strand | + |
Start bp | 30108 |
End bp | 31712 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637715095 |
Product | extracellular solute-binding protein |
Protein accession | YP_320289 |
Protein GI | 75812672 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0146567 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACACT TCAAGGTACT ATTTGGCATC ATCCTCATAC AAATTTTGGT GGGTTGTAAT TTAGCTACAC CTAAGCGAGT AGCTAACAAT CCTTCTCAGC CAAAAAAGCA AATTGTGATT GCTTTGGGGT GGGGAGATTC ACCAACAGGG TTTGATCCGA CATTGGGTTG GGGTTATCAT GACCCTCCTT TGTTCCAAAG TACCTTGGTA CGTCGGGACG AAAATCTGCA ATTAGTTAAT GATTTGGCTA AAAGTTATAC TTTGAGTCCT GATAAAAAAG TCTGGAGGTT TAAAATCCGT CCAGATGTGC GGTTTTCTGA TGGTAAAATG CTAACAGCCG CAGATGTCGC CTATACATTT AATCAAGCAA AAGCCAGTCC TGGTCTAACC GATGTGACAA TTTTGGATAA AGCTGTAGCG AAGAGTGCTT ATGAAGTAGA ACTATATCTG AAGCAGCCGC AAATTACCTT TATTAATCGC ATCGCCCAAT TGGGAATTGT CCCCAAACAT CTGCATAATC AAAACTATGG TCGAAATCCC ATTGGTTCAG GCCCTTATCG TTTGGTGCAA TGGGATGAAG GTCAACAAAT GATTGTTGAA GCTAATCCCG ATTACTACGG CGAACAACCA GAAATCAAAA AGATTGTATT TTTGTTTACT AGAGGAGATG CGGTATTTAC GGCTGCCAAA GCGGGAGAAC TGGACTTAGC ACAAATTCCG CCTTTTTTAG CCAAGCAGTC AGTTACAGGA ATGAATTTGT ATGCCATCAA CAGCAATAGT CGTGTAGGCT TGATGTTTCC ATATCTTCCC AATACAGGAC GTAAAACTAC TGAGGGAAAT CCCATTGGTA ACAATGTCAC AGCAGACCGA GCCATTCGCC AAGCGGTGAA TTACGCAATT AATCGCCAAG CTTTAGTTAC AGGTATTTTA GAAGGTTATG GTTCACCAGC TTATGGTGCA GCCAGTAAAT TACCTTGGGA TCAACCACAA GCGGCGATCG CAGATGGAAA CCCCGACAAA GCAAAACAAA TTTTAAGCGC AGGAGGTTGG AGAGATAGCA ACGGGGATGG GGTATTAGAA AAAGCGGGGA TGAAAGCAGA ATTTACGATT CTCTACCCTG TAAGTAATCC TACCTCCCAA GGTCTAGCAT TAGCGATCGC CCAAATGCTC AAACCTGTAG GTATCAAAGT TAACGTTGAC GGTAAAAGCT GGGAAGATAT TTCGCGGCGA ATGCACCAAG ATGTAGGGCT GTTTCCCTGG GGAATATACG ACCCAATGGA GTTATACATT CTTTACCACA GTTCGGCAGC CCAAGGAAAC TGGCGTAACT CCGGTTACTA TTCTAATCCC CAGGTTGACC AAGCACTCGA CAAAGCAATG GCCGCCGCAT CAGAAACAGC AGCTTTGCCG TTTTGGCAAC AGGCGCAATG GAACGGTCAA ACTGGGACAG TCACCATAGG AGATGCAGCA TCAGCTTGGC TAGTCAACCT GGAGCAAATT TATCTTGTGA GTTCTTGTTT AGATATTGGT AGACCAATTC AGGCTAGTAA TTACACTGGC TCAATTATGA TCAATATTAC TAAGTGGAAG TGGATATGTA ATTAG
|
Protein sequence | MKHFKVLFGI ILIQILVGCN LATPKRVANN PSQPKKQIVI ALGWGDSPTG FDPTLGWGYH DPPLFQSTLV RRDENLQLVN DLAKSYTLSP DKKVWRFKIR PDVRFSDGKM LTAADVAYTF NQAKASPGLT DVTILDKAVA KSAYEVELYL KQPQITFINR IAQLGIVPKH LHNQNYGRNP IGSGPYRLVQ WDEGQQMIVE ANPDYYGEQP EIKKIVFLFT RGDAVFTAAK AGELDLAQIP PFLAKQSVTG MNLYAINSNS RVGLMFPYLP NTGRKTTEGN PIGNNVTADR AIRQAVNYAI NRQALVTGIL EGYGSPAYGA ASKLPWDQPQ AAIADGNPDK AKQILSAGGW RDSNGDGVLE KAGMKAEFTI LYPVSNPTSQ GLALAIAQML KPVGIKVNVD GKSWEDISRR MHQDVGLFPW GIYDPMELYI LYHSSAAQGN WRNSGYYSNP QVDQALDKAM AAASETAALP FWQQAQWNGQ TGTVTIGDAA SAWLVNLEQI YLVSSCLDIG RPIQASNYTG SIMINITKWK WICN
|
| |