Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4240 |
Symbol | |
ID | 3680946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5318542 |
End bp | 5319915 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637719588 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_324734 |
Protein GI | 75910438 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.615359 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACAGT ACATACATGG CGGTGTGAAT CGTCGTAAGT TCTTAGGTAT GACTGCTGCT GGTACTCTTA TGGCCACAGC TAGTGCCAAT TTATTCTCAA GAGCGACAGC CCAATCTAGT CGCCCAAATG TGGTGTTTAT TTTAGTTGAT GACATGGGTT GGGGCGACCT GAGCATCTAT GGACGCACAG ATTACGAAAC TCCTAATCTA GACAGACTGG CACGGCAGGG AGTACGTTTC ACGAATGCTT ACGCGAATCA AACCGTTTGT ACTCCTACAC GGATAGCTTT CTTAACGGGA CGATATCAAG CGCGATTACC CGTCGGCTTA CGAGAACCTC TAGGCGCGCG CTCACAACCA GCTAGTAATA ACATAGGAAT ACCAGCCAAT CAACCCACCA TAGCCTCACT ACTGAAAGCA AATGGTTATG AAACTGCGTT GGTTGGTAAG TGGCACGCTG GTTATCCCCC TAACTTTGGG CCTCTCCAAA AGGGCTTTGA CGAGTACTTT GGACACTTAA GCGGTGGAAT TGAATATTTC ACGCATACAG GTACAGATCG GATACTGGAT CTCTATGAAA ATGATGTACC TGTACAGCGT TCTGGGTATG TTACAGATTT GTTTACAGAC AGAGCAGTTG AATTCATCCA ACGTCCACAC TCTCGCCCAT TTTATCTAAG TTTGCACTAC AATGCGCCCC ATTGGCCTTG GCAGGGGCCA AATGATCAAG CATCAACTGC TTTTTATCTG ACTAATGGTT ATACAGTAGG TGGTTCACAA GCAACCTATG CTGCAATGGT CAAGAGTTTG GATGACGGAG TTGGCAGAGT ATTAGACGCA CTGGAAGCAA GCGGACAAGC TGATAATACC TTGGTAATTT TTACCAGTGA TAATGGTGGC GAAAGATTCT CTAACTTTGG GCCATTCCGG GGGCAAAAGG CTAGTTTATA TGAAGGTGGT ATACGAGTAC CTGCCATCAT TCGCTATCCA GGTGTGACTC AAGCTAATCA AGTGAGCAAT CAGGTGATTA TCACTTTTGA TTTAACTGCA ACTATTCTTG CTGCCACTGG CACAAGTTTC CATCCCAACT ATCCACCAGA TGGTCAAAAT TTACTTCCCT TACTACGTGG CGATCGCAGT GAGTTTTCCC GCACCTTGTT TTGGCGTTAT GGGGCGGCGT TAACAACAAG GCAAAGAGCT GTGCGAAGCG GTGACTGGAA GTATTGGAGA CGAGGAAACC AAGAAGCTTT GTTTAACTTA GCAACTGATC CAGGCGAAAC AACAGACCTC AAGGATAGTA ATGCACAGGT ATTTACACGA CTACGCAACC AATTCCAACA TTGGGAATTA CAAATGTTGC CTTATGGATC TTAA
|
Protein sequence | MTQYIHGGVN RRKFLGMTAA GTLMATASAN LFSRATAQSS RPNVVFILVD DMGWGDLSIY GRTDYETPNL DRLARQGVRF TNAYANQTVC TPTRIAFLTG RYQARLPVGL REPLGARSQP ASNNIGIPAN QPTIASLLKA NGYETALVGK WHAGYPPNFG PLQKGFDEYF GHLSGGIEYF THTGTDRILD LYENDVPVQR SGYVTDLFTD RAVEFIQRPH SRPFYLSLHY NAPHWPWQGP NDQASTAFYL TNGYTVGGSQ ATYAAMVKSL DDGVGRVLDA LEASGQADNT LVIFTSDNGG ERFSNFGPFR GQKASLYEGG IRVPAIIRYP GVTQANQVSN QVIITFDLTA TILAATGTSF HPNYPPDGQN LLPLLRGDRS EFSRTLFWRY GAALTTRQRA VRSGDWKYWR RGNQEALFNL ATDPGETTDL KDSNAQVFTR LRNQFQHWEL QMLPYGS
|
| |