Gene Ava_4066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4066 
Symbol 
ID3681687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5055470 
End bp5056540 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content45% 
IMG OID637719417 
ProductABC transporter, substrate-binding protein, aliphatic sulphonates 
Protein accessionYP_324565 
Protein GI75910269 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.559001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAGTC AGTTTGTATT GCCTCAACTA ATGAAGACGG TCAGCCGCTT TCTTCAAAAT 
TATGAGATGT CTAGAATACG TAAATTTATC TGCTGGTTTA CATTGGGGCT AAGTTTGAGT
CTGGTTATTT CTGCCTGTTC TCCTAGCAAT ACAAACAACT CTGCGGTAAC GTCAACAGCA
AGCCCCACCC CTACAACTCA AGGTGTTGCA ATTCGCATTG GTTATCAAAA AGCAGCCACA
GTTCTCAATG CAATGCGAAG CAGGGGAGAA GTAGAAAAAG CCTTGACTGC TGCCGGTGCA
ACGGTTACAT GGACAGAATT TCCCGCCGGG CCACCGATGC TAGAAGCGAT GAATGCAGGT
AGTATCGACT TTGGTTATAC AGGAGAATCA CCCCCCATTT TTGCTCAAGC TGGTGGTGTT
CCCCTATTGT ATGTAGCTTA CGACCCTTGG AGTCCCAAAG CCGAAGCAAT TATCGTACCG
AAAAATTCAC CAATTAAAAC TGTCGCTGAA CTCAAGGGTA AAAGAGTCGC CTTTGCCAAA
GGTTCTAATA CTAACTATTT AGTAGTCAAA GCCCTAGAAG CAGCCGGACT AAACTATAGT
GACATCAAAC CCGCCTATCT CACCCCCGCA GATGCCCGCG CAGCTTTTGA AGGTGGTAAC
GTTGATGCTT GGGCAATTTG GGACCCTTTT CTAGCAGCAG TTGAACAGGC TACAGGTGCA
AGGATTCTCA CCGATGCCAC AAATTTAGCA CCCAATCGAG GTTACTATCT GGTGCGTCAA
GCCTTTGTGA ATACTCATGG AGATGTATTG AAAACCCTGT TAGATGAAGT TACCAAAGTC
GATAAATGGG CAGCCAATAA CCCCCAAGAA GTAGCTAAAT TTTTAGAACC AGAATTAGGC
ATTCCCGCCG CCGCCTTGGA AGTTGCCGAG AAACGCCGAC AGTATGGGGT TTTCCCGTTA
ACAGATGAAG TAATTAGCAA GCAGCAAGAT ATTGCCGATA CCTTTTACAA AATCCAACTA
ATTCCCAAAC AAATTCAAGT AAAAGACATC GTTTGGCAAG GCAAGAAATA A
 
Protein sequence
MWSQFVLPQL MKTVSRFLQN YEMSRIRKFI CWFTLGLSLS LVISACSPSN TNNSAVTSTA 
SPTPTTQGVA IRIGYQKAAT VLNAMRSRGE VEKALTAAGA TVTWTEFPAG PPMLEAMNAG
SIDFGYTGES PPIFAQAGGV PLLYVAYDPW SPKAEAIIVP KNSPIKTVAE LKGKRVAFAK
GSNTNYLVVK ALEAAGLNYS DIKPAYLTPA DARAAFEGGN VDAWAIWDPF LAAVEQATGA
RILTDATNLA PNRGYYLVRQ AFVNTHGDVL KTLLDEVTKV DKWAANNPQE VAKFLEPELG
IPAAALEVAE KRRQYGVFPL TDEVISKQQD IADTFYKIQL IPKQIQVKDI VWQGKK