Gene Ava_4714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4714 
Symbol 
ID3679733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5889386 
End bp5890531 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content42% 
IMG OID637720070 
Productthiosulphate-binding protein 
Protein accessionYP_325206 
Protein GI75910910 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.217544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.582939 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGT GGCAGCGTCC CCTGAAAAAA TTGTGGTTGC TAGCAGAGCA GAGGACATAT 
AGGTTCAGAT TTAACTCCCT GAAAAGCTTT GTGTCACTGG TATTAGTTGG GGCTGTTTTG
AGTGTGGCGC TTGCAGCCTG CACTGGTGGT AGTGAAAACA ATACTTCCAC CGCAAACCCC
GTTGCTAGTC CTGTTGCAGC CAACAAACCA AACGTGGAAT TAACTCTAGT ATCCTTTGCT
GTTACCAAGG CAGCACATGA GGCAATTATT CCTAAATTTG TAGAAAAATG GAAGCAAGAA
CATAACCAAA CTGTCACATT TAAACAGAGT TATGGCGGTT CTGGTTCTCA AACCCGTGCC
GTTATCGATG GTTTAGAAGC AGATGTTGTC CACTTAGCAC TGGCTTTAGA CACGCAAAAA
ATTGAGAAAG CAGGGTTAAT TCAACCAGGA TGGGAAAAAG AACTTCCTAA TGATGGAATT
GTCTCCAAAT CTGTAGCAGC AATCATTACC CGTCCAGGCA ACCCCAAAGG TATCAAAACA
TGGGCAGATT TGGCTAAGGA CGATGTTAAA GTAATTACTG CTGATCCTAA AACCTCAGGT
ATTGCTCGCT GGAACTTCTT AGCTTTGTGG AACTCCGTGA TCAAAACTGG TGGTGATGAA
GCTAAGGCAA CAGAGTTTGT GACTAAGGTT TATGGAAACG TGCCAATTTT AACCAAGGAT
GCGCGGGAAG CAACTGACGC ATTTGCCAAA CAAGGCCAAG GAGACGCTCT GATTAATTAC
GAAAATGAAG TCATTTTGGC ACAGCAAAAA GGTGAAAAGT TGGATTATGT AATTCCTAGT
GTCAATATTT CTATTGATAA TCCTATCGCT GTCGTAGATC AGAATGTTGA TAAGCATGGT
AATCGAGAAG TAGCCGAAGG ATTTGTCAAA TTCTTATATA CCCCAGAAGC ACAAGAAGAG
TTTGTGAAAT TAGGTTTTCG ACCAGTAGAT GAGAAAGTTG CTCAAACGAA AGAAGTAACA
GATAAGTTTC CCAAAGTAGA TACTCTAGGT ACTGTTCAAG ACTTAGGAGG CTGGGCAACA
ATTGACAAAA AATTCTTTGC TGATGGTGGT GTTTTTGACC AAATTCAAGC CAAAAACAAG
CGGTAA
 
Protein sequence
MSEWQRPLKK LWLLAEQRTY RFRFNSLKSF VSLVLVGAVL SVALAACTGG SENNTSTANP 
VASPVAANKP NVELTLVSFA VTKAAHEAII PKFVEKWKQE HNQTVTFKQS YGGSGSQTRA
VIDGLEADVV HLALALDTQK IEKAGLIQPG WEKELPNDGI VSKSVAAIIT RPGNPKGIKT
WADLAKDDVK VITADPKTSG IARWNFLALW NSVIKTGGDE AKATEFVTKV YGNVPILTKD
AREATDAFAK QGQGDALINY ENEVILAQQK GEKLDYVIPS VNISIDNPIA VVDQNVDKHG
NREVAEGFVK FLYTPEAQEE FVKLGFRPVD EKVAQTKEVT DKFPKVDTLG TVQDLGGWAT
IDKKFFADGG VFDQIQAKNK R