Gene Ava_4240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4240 
Symbol 
ID3680946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5318542 
End bp5319915 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content46% 
IMG OID637719588 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_324734 
Protein GI75910438 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.615359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAGT ACATACATGG CGGTGTGAAT CGTCGTAAGT TCTTAGGTAT GACTGCTGCT 
GGTACTCTTA TGGCCACAGC TAGTGCCAAT TTATTCTCAA GAGCGACAGC CCAATCTAGT
CGCCCAAATG TGGTGTTTAT TTTAGTTGAT GACATGGGTT GGGGCGACCT GAGCATCTAT
GGACGCACAG ATTACGAAAC TCCTAATCTA GACAGACTGG CACGGCAGGG AGTACGTTTC
ACGAATGCTT ACGCGAATCA AACCGTTTGT ACTCCTACAC GGATAGCTTT CTTAACGGGA
CGATATCAAG CGCGATTACC CGTCGGCTTA CGAGAACCTC TAGGCGCGCG CTCACAACCA
GCTAGTAATA ACATAGGAAT ACCAGCCAAT CAACCCACCA TAGCCTCACT ACTGAAAGCA
AATGGTTATG AAACTGCGTT GGTTGGTAAG TGGCACGCTG GTTATCCCCC TAACTTTGGG
CCTCTCCAAA AGGGCTTTGA CGAGTACTTT GGACACTTAA GCGGTGGAAT TGAATATTTC
ACGCATACAG GTACAGATCG GATACTGGAT CTCTATGAAA ATGATGTACC TGTACAGCGT
TCTGGGTATG TTACAGATTT GTTTACAGAC AGAGCAGTTG AATTCATCCA ACGTCCACAC
TCTCGCCCAT TTTATCTAAG TTTGCACTAC AATGCGCCCC ATTGGCCTTG GCAGGGGCCA
AATGATCAAG CATCAACTGC TTTTTATCTG ACTAATGGTT ATACAGTAGG TGGTTCACAA
GCAACCTATG CTGCAATGGT CAAGAGTTTG GATGACGGAG TTGGCAGAGT ATTAGACGCA
CTGGAAGCAA GCGGACAAGC TGATAATACC TTGGTAATTT TTACCAGTGA TAATGGTGGC
GAAAGATTCT CTAACTTTGG GCCATTCCGG GGGCAAAAGG CTAGTTTATA TGAAGGTGGT
ATACGAGTAC CTGCCATCAT TCGCTATCCA GGTGTGACTC AAGCTAATCA AGTGAGCAAT
CAGGTGATTA TCACTTTTGA TTTAACTGCA ACTATTCTTG CTGCCACTGG CACAAGTTTC
CATCCCAACT ATCCACCAGA TGGTCAAAAT TTACTTCCCT TACTACGTGG CGATCGCAGT
GAGTTTTCCC GCACCTTGTT TTGGCGTTAT GGGGCGGCGT TAACAACAAG GCAAAGAGCT
GTGCGAAGCG GTGACTGGAA GTATTGGAGA CGAGGAAACC AAGAAGCTTT GTTTAACTTA
GCAACTGATC CAGGCGAAAC AACAGACCTC AAGGATAGTA ATGCACAGGT ATTTACACGA
CTACGCAACC AATTCCAACA TTGGGAATTA CAAATGTTGC CTTATGGATC TTAA
 
Protein sequence
MTQYIHGGVN RRKFLGMTAA GTLMATASAN LFSRATAQSS RPNVVFILVD DMGWGDLSIY 
GRTDYETPNL DRLARQGVRF TNAYANQTVC TPTRIAFLTG RYQARLPVGL REPLGARSQP
ASNNIGIPAN QPTIASLLKA NGYETALVGK WHAGYPPNFG PLQKGFDEYF GHLSGGIEYF
THTGTDRILD LYENDVPVQR SGYVTDLFTD RAVEFIQRPH SRPFYLSLHY NAPHWPWQGP
NDQASTAFYL TNGYTVGGSQ ATYAAMVKSL DDGVGRVLDA LEASGQADNT LVIFTSDNGG
ERFSNFGPFR GQKASLYEGG IRVPAIIRYP GVTQANQVSN QVIITFDLTA TILAATGTSF
HPNYPPDGQN LLPLLRGDRS EFSRTLFWRY GAALTTRQRA VRSGDWKYWR RGNQEALFNL
ATDPGETTDL KDSNAQVFTR LRNQFQHWEL QMLPYGS