Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1027 |
Symbol | |
ID | 3678695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 1243574 |
End bp | 1244950 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637716363 |
Product | bicarbonate transport system substrate-binding protein |
Protein accession | YP_321546 |
Protein GI | 75907250 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.181538 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGAGT TTTTTAATCA ATTTTCTCGC CGCAAGTTTA TAGTTACAGC AGGAGCTTCG GCAGGTGCAG TGTTCCTCAA AGGTTGCTTG GGAAATCCGC CTGAGACTAC CGGAGGAACA CAATCTGCAC CAACTGCTCA ACCTGCTGCT AATGTTAGCG CAGAGCAAGC ACCAGAAGTC ACTACTGTGA AGTTGGGATA TATTCCCATT GTGGAATCGG CTCCTTTAAT TATTGCCAAA GAAAAAGGTT TCTTTGCTAA ATATGGATTA ACTAATGTAG AACTTTCTAA ACAAGCTTCT TGGGGTTCTG CTAGAGATAA CGTAGAAATT GGTTCGGCTG GTGGTGGTAT TGATGGCGGT CAATGGCAAA TGCCTATGCC ACACTTGATT ACCGAAGGTT TAATTACTAA GGGTAATCAA AAGATACCCA TGTATGTGTT ATGTCAATTA ATTACACATG GGAATGGAAT TGCGATCGCT AACAAGCACC AAGGTAAAGG TATCAGTTTA AAATTAGAAG GCGCTAAGTC TTTATTTAGC CAACTCAAGT CTTCTACACC CTTCACTGCC GCTTTCACTT TCCCCCACGT CAACCAAGAC TTATGGATTC GCTACTGGTT AGCCGCAGGC GGTATTGACC CAGATGCAGA TGTCAAACTG CTGACAGTCC CGGCGGCGCA AACTGTAGCT AACATGAAAA CCGGCACAAT GGACGCTTTC AGCACAGGTG ACCCCTGGCC ATTCCGCTTG GTCAACGACA AAATTGGCTA CATGGCCGCC TTAACCGCAG AGATTTGGAA AAATCACCCA GAAGAATACT TGGCAATGAG AGCTGATTGG GTGGATAAAT ACCCCAAAGC AACCAAAGCG TTACTCAAAG GCATTATGGA GGCGCAACAG TGGTTAGATA ATTTTGACAA CCGCAAAGAA GCAGCTCAAA TTCTGGCTGG AAGAAATTAT TTCAACCTCA ATAATCCAGA AATTCTGGCA GACCCATACG TCGGCAAATA TGATATGGGT GATGGTCGCA AAATTGATGA TAAATCAATG GCGGCTTACT ACTGGAAAGA TGAAAAAGGT AGTGTTTCTT ATCCCTACAA GAGTCATGAT TTGTGGTTCA TCACAGAAAA TGTACGTTGG GGATTCTTAC CCAAAGATTA CCTAGCTAAT GGTGCAGCTA AAGCCAAAGA ATTAATCGAT AAAGTCAACC GCGAAGATAT TTGGAAAGAA GCGGCTAAAG AGGCGGGAAT TGCTGCGGCT GATATTCCCA CAAGTACATC TCGCGGTGTT GAAGAATTCT TTGATGGCAC AAAATTTGAC CCCGAAAAGC CAGACGAATA TCTCAAGAGC CTGAAAATCA AGAAAGTTAG TGTTTAG
|
Protein sequence | MTEFFNQFSR RKFIVTAGAS AGAVFLKGCL GNPPETTGGT QSAPTAQPAA NVSAEQAPEV TTVKLGYIPI VESAPLIIAK EKGFFAKYGL TNVELSKQAS WGSARDNVEI GSAGGGIDGG QWQMPMPHLI TEGLITKGNQ KIPMYVLCQL ITHGNGIAIA NKHQGKGISL KLEGAKSLFS QLKSSTPFTA AFTFPHVNQD LWIRYWLAAG GIDPDADVKL LTVPAAQTVA NMKTGTMDAF STGDPWPFRL VNDKIGYMAA LTAEIWKNHP EEYLAMRADW VDKYPKATKA LLKGIMEAQQ WLDNFDNRKE AAQILAGRNY FNLNNPEILA DPYVGKYDMG DGRKIDDKSM AAYYWKDEKG SVSYPYKSHD LWFITENVRW GFLPKDYLAN GAAKAKELID KVNREDIWKE AAKEAGIAAA DIPTSTSRGV EEFFDGTKFD PEKPDEYLKS LKIKKVSV
|
| |