Gene Ava_5003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_5003 
Symbol 
ID3679055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6288550 
End bp6289593 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content41% 
IMG OID637720363 
ProductABC transporter, substrate-binding protein, aliphatic sulphonates 
Protein accessionYP_325495 
Protein GI75911199 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00833657 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.452376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC CATTTCTTGA TCAACGGCGT AAATTCTTTA AACAGCTTGT ACAATATTCA 
GCATTAGGAT TATTTACTTT CTCAACACCC CTAGTAGGTA GCTTATTACA GAGTACTCAA
GCCCAAACCC AACGAGGATT TGTTGCCAAA GAAATCAACT TAGCTTATCA AACTTCTGGT
GATATTGTTA AAGTCCGAAA AGTTGTAGAG CCACGGTTTA AAGCTTTGGG TATCAAAGTG
AATTGGGTAG GGCCATTCCC AGCCGGGCCG CAATTGATAG AAGCAATGAA CGCAGGTAGA
GTTGATATTG GTACTGTGGG AGAAACACCG CCAATATTTT CCCAAGCCGC AGGCATCACA
GAAGTTATCT ACATTGCTGG ACGCACTCCT AGTCGGGGTC AAAACCAAGG TGTTGTAGTC
AGAGCTGATT CTCCGATTAA AAAATTAGCT GATATTAAAG GGAAAAAAGT TGCTTTTCAG
AGGGGATCAA ATGCACATTA TTTACTAGCA AAAGCTTTAC AAGAAGTGGG CTTAAAAATT
AGTGATGTGC AAATTGTTGG TTTAACTCCA TCAGAAGCTC GTGATGCTTT CATTCAAAAC
AAAGTGGATG TTTGGGTAGC TAGTGACCCA TTCTTAGCTC TGGTGGAAAA AATTATTCCG
ATTCGCAATT TGAGGAATGC AGCCAAAATT AACACATTGG GTGGATTTTA CCTGGGTAGA
CGTAGATTTG TCACTCAAAA TCCTGAATTG GTAAGGGTGT TTTTAGAAGA AGCAGACAAG
GTAGGCGAAT GGGCAGAAAA AAATCCCACT GAAGTTGCTA AGGCTTTTGC ACCGGAACTG
AAATTGGAAG TATCAGTTTT AGAAAAGGTA GCACGCCGAC GTACTTATCG CTTAAGAAGA
CTCTCACCTG CGATTATTGC TGAACAACAA CGGGTAGCTG ATTTTTACTT TCAAGAAAAA
ATAATTCCCC GCAAAATAAA CATTCAGGAT GCACTACTAC CGCCACAATT ATCGGCAGCA
ATTACACCCA AGCGTCTGAA ATGA
 
Protein sequence
MKIPFLDQRR KFFKQLVQYS ALGLFTFSTP LVGSLLQSTQ AQTQRGFVAK EINLAYQTSG 
DIVKVRKVVE PRFKALGIKV NWVGPFPAGP QLIEAMNAGR VDIGTVGETP PIFSQAAGIT
EVIYIAGRTP SRGQNQGVVV RADSPIKKLA DIKGKKVAFQ RGSNAHYLLA KALQEVGLKI
SDVQIVGLTP SEARDAFIQN KVDVWVASDP FLALVEKIIP IRNLRNAAKI NTLGGFYLGR
RRFVTQNPEL VRVFLEEADK VGEWAEKNPT EVAKAFAPEL KLEVSVLEKV ARRRTYRLRR
LSPAIIAEQQ RVADFYFQEK IIPRKINIQD ALLPPQLSAA ITPKRLK