Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4066 |
Symbol | |
ID | 3681687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5055470 |
End bp | 5056540 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637719417 |
Product | ABC transporter, substrate-binding protein, aliphatic sulphonates |
Protein accession | YP_324565 |
Protein GI | 75910269 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.559001 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGAGTC AGTTTGTATT GCCTCAACTA ATGAAGACGG TCAGCCGCTT TCTTCAAAAT TATGAGATGT CTAGAATACG TAAATTTATC TGCTGGTTTA CATTGGGGCT AAGTTTGAGT CTGGTTATTT CTGCCTGTTC TCCTAGCAAT ACAAACAACT CTGCGGTAAC GTCAACAGCA AGCCCCACCC CTACAACTCA AGGTGTTGCA ATTCGCATTG GTTATCAAAA AGCAGCCACA GTTCTCAATG CAATGCGAAG CAGGGGAGAA GTAGAAAAAG CCTTGACTGC TGCCGGTGCA ACGGTTACAT GGACAGAATT TCCCGCCGGG CCACCGATGC TAGAAGCGAT GAATGCAGGT AGTATCGACT TTGGTTATAC AGGAGAATCA CCCCCCATTT TTGCTCAAGC TGGTGGTGTT CCCCTATTGT ATGTAGCTTA CGACCCTTGG AGTCCCAAAG CCGAAGCAAT TATCGTACCG AAAAATTCAC CAATTAAAAC TGTCGCTGAA CTCAAGGGTA AAAGAGTCGC CTTTGCCAAA GGTTCTAATA CTAACTATTT AGTAGTCAAA GCCCTAGAAG CAGCCGGACT AAACTATAGT GACATCAAAC CCGCCTATCT CACCCCCGCA GATGCCCGCG CAGCTTTTGA AGGTGGTAAC GTTGATGCTT GGGCAATTTG GGACCCTTTT CTAGCAGCAG TTGAACAGGC TACAGGTGCA AGGATTCTCA CCGATGCCAC AAATTTAGCA CCCAATCGAG GTTACTATCT GGTGCGTCAA GCCTTTGTGA ATACTCATGG AGATGTATTG AAAACCCTGT TAGATGAAGT TACCAAAGTC GATAAATGGG CAGCCAATAA CCCCCAAGAA GTAGCTAAAT TTTTAGAACC AGAATTAGGC ATTCCCGCCG CCGCCTTGGA AGTTGCCGAG AAACGCCGAC AGTATGGGGT TTTCCCGTTA ACAGATGAAG TAATTAGCAA GCAGCAAGAT ATTGCCGATA CCTTTTACAA AATCCAACTA ATTCCCAAAC AAATTCAAGT AAAAGACATC GTTTGGCAAG GCAAGAAATA A
|
Protein sequence | MWSQFVLPQL MKTVSRFLQN YEMSRIRKFI CWFTLGLSLS LVISACSPSN TNNSAVTSTA SPTPTTQGVA IRIGYQKAAT VLNAMRSRGE VEKALTAAGA TVTWTEFPAG PPMLEAMNAG SIDFGYTGES PPIFAQAGGV PLLYVAYDPW SPKAEAIIVP KNSPIKTVAE LKGKRVAFAK GSNTNYLVVK ALEAAGLNYS DIKPAYLTPA DARAAFEGGN VDAWAIWDPF LAAVEQATGA RILTDATNLA PNRGYYLVRQ AFVNTHGDVL KTLLDEVTKV DKWAANNPQE VAKFLEPELG IPAAALEVAE KRRQYGVFPL TDEVISKQQD IADTFYKIQL IPKQIQVKDI VWQGKK
|
| |