Gene Ava_5012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_5012 
Symbol 
ID3679025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6300326 
End bp6301435 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content39% 
IMG OID637720372 
ProductABC transporter, substrate-binding protein, aliphatic sulphonates 
Protein accessionYP_325504 
Protein GI75911208 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0114898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.178041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAT TAATAAAAGA TTTTGCAAAT ATATCTCAAG CGTGGACAAG TAAACGCACT 
ACTCGTCGTC ACGCCTTATT TGCCTTTGGC TGTAGCCTAG TATTGTCTAC CACACTATTT
AGTTGCAGTC CTTCACAAAA TAATAACCAG CAACAAGCAT CTTCTTCCGC ATCTAATGTA
GCAAATACAA ATGCTACTAA TAAAGTGGTG AGGATTGTCC GTTCCAAACA ATTGACAGCT
TTAGCGGTTC TAGAACAAAA ACGTATTCTA GAAGAGCGAC TAAAACCTTT AGGTTACAAA
GTAGAATGGC CTGAATTTGC TGCTGGCCCT CAGCAATTAG AAGCTTTGAA TACAGGCGCA
CTGGATATTG CTTCAACTGC TGAATCACCT CCTATCTTTT CCCAAGCAGC AGGGACACCT
CTAGTTTATT TAGCTGCTAA TTCTTCTGAT GGTAAAGCAG TGTCGCTATT AGTTCCTGCT
AACTCTAATG TTAAAAGTGT TAAAGACTTA AAAGGCAAGA AAATTGCTTC TCAAAAAGCA
TCTATCGGTC ACTATCTTAT AGTCAGAGCT GTAGAAAGAG AAGGTTTAAA ACTGAGTGAT
ATACAGCCAG TTTATCTACC ACCTCCGGAC GCAAATGTGG CATTTAGCCA AGGTAAAGTG
GATGCTTGGT TTATTTGGGA ACCATTTGTG ACTAGAAATG TACAACAGAA GGTTGGCAGA
GTTTTAACAG ATGGTGGTAA TGGTTTACGG GATACTAACA ACTATGTCTC TACAACCCGT
AAGTTTTATC AAGAAAATCC AGAGTTAATC AAAATATTTC TGGAAGAACT GCAAAAAGCC
CAAAATTGGG CAAAAAATAA CCCCAAAGAA CTGGCTAACT TACTTGCTCA AACTACTCAA
CTTGACCCGC CTACATTAGA AATTATGCAC AGTAAGTATG ATTTCACACT CATACCAATT
ACTGAACAAA TTATTAACAA ACAGCAGGAA GTTGCTGACA AATGGTACCG TTTAGGGCTG
ATACCAAGAA AGGTGAATGT CAGAGATGGC TTTTTAACTC CAGAACAATA TGCGGAAATT
ACTCCCCAGG AAGTGCTGGC AAAAAAATAG
 
Protein sequence
MSQLIKDFAN ISQAWTSKRT TRRHALFAFG CSLVLSTTLF SCSPSQNNNQ QQASSSASNV 
ANTNATNKVV RIVRSKQLTA LAVLEQKRIL EERLKPLGYK VEWPEFAAGP QQLEALNTGA
LDIASTAESP PIFSQAAGTP LVYLAANSSD GKAVSLLVPA NSNVKSVKDL KGKKIASQKA
SIGHYLIVRA VEREGLKLSD IQPVYLPPPD ANVAFSQGKV DAWFIWEPFV TRNVQQKVGR
VLTDGGNGLR DTNNYVSTTR KFYQENPELI KIFLEELQKA QNWAKNNPKE LANLLAQTTQ
LDPPTLEIMH SKYDFTLIPI TEQIINKQQE VADKWYRLGL IPRKVNVRDG FLTPEQYAEI
TPQEVLAKK