Gene Ava_5002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_5002 
Symbol 
ID3679054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6287435 
End bp6288478 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content42% 
IMG OID637720362 
ProductABC transporter, substrate-binding protein, aliphatic sulphonates 
Protein accessionYP_325494 
Protein GI75911198 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.016963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.417754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCAAC CATTTGAACC ACAAAGACGA AAAGTTTTAC AAGGATTATT AACATATTCA 
GCCTTAGGAT TATCTGGTTT TGCTGCCCCG ATAATTGGAG ACATATTACA AGCCAATGCT
CAGACAAAAC GGTCAGGAGT TAATACTAAA ACAGTTCGAG TGGGTTATCA AACTTCTGGA
GATTTAGTCA AAATCAAAAA AGTTTTAGAA CCACGACTCA AATCTATCGG TGTGGATATA
GAATGGACAC CATTTCCGGC GGGACCCCAA CTGTTAGAAG CTATGAATGT AGGTAGGGTT
GATATTGGTT CAGTGGGAGA AACACCACCA ATATTTGCTC AAGCAGCAGG TGCGCCACTG
GTGTATATCG CTGGACGCAA ACCTAGCAAG GGTGAAGGTA GTGCGATCAT TGTCAGAAAA
AACTCACCTA TTAAGAGAGT TGCAGACCTC AAAGGTCAGA AAGTAGTTTT TCAAAAAGGA
TCAGCATCAC ACTATCTACT TGTCAGTGCT TTAAAAGAAG CAGGTTTGAA ATTTAAGGAT
ATTCAAGCCA TCAGTTTAGC ACCTGCGGAA GCTCGTGATG CCTTCCTGCA AGAAAAAATA
GATGCGTGGG TAACATGGGA CCCCTTCTAT GCTTTTGTGC AGAAAAATGC TGGCGCTCGC
ACTTTAAGAA ATGCGGCTGG AATTGCTACT CAAGGTGGGT TTTATCTATC ACGGCGCGAA
TTTGCTGTCC AAAATCCTGA AGTGGTGAAG GTAATTTTAG ATGAGATAGA CAAACTAGGA
CGATGGGCAG AAAGTAACCC TAAAGAAGTA GTGAAAATTC TGGCTCCTGA ACTGAAGCTG
GAACCGTCAC TTTTAGAAAC TGTGGTGCGC CGACGCACTT ATGGATTAAG AAGACTTACT
CCTTCTCTAG TTGCTGAACA ACAGCGCATT GCAGATTTAT TTTACGCAGA GAAGATTATA
CCGAAGAAAA TTGATATTAG ACAGGCATTA CTTACTTCTC AACAATATGC AGCGATCACG
CCTCAGCGTA TTAGTGGCCG ATAG
 
Protein sequence
MHQPFEPQRR KVLQGLLTYS ALGLSGFAAP IIGDILQANA QTKRSGVNTK TVRVGYQTSG 
DLVKIKKVLE PRLKSIGVDI EWTPFPAGPQ LLEAMNVGRV DIGSVGETPP IFAQAAGAPL
VYIAGRKPSK GEGSAIIVRK NSPIKRVADL KGQKVVFQKG SASHYLLVSA LKEAGLKFKD
IQAISLAPAE ARDAFLQEKI DAWVTWDPFY AFVQKNAGAR TLRNAAGIAT QGGFYLSRRE
FAVQNPEVVK VILDEIDKLG RWAESNPKEV VKILAPELKL EPSLLETVVR RRTYGLRRLT
PSLVAEQQRI ADLFYAEKII PKKIDIRQAL LTSQQYAAIT PQRISGR