Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_5004 |
Symbol | |
ID | 3679056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 6289793 |
End bp | 6291613 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637720364 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_325496 |
Protein GI | 75911200 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00831315 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.753431 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCCG ATCACGATTT TGATCAAGCT TCTCAAGAGC AGAATGAAAC AACAAATCCT CAATCTCGTA GTAGTGGAAA AAGACGTGGT AGACTTTTTG GGCGAGACGG TGTTAGTAGA CGTTTATTTC TTGGTCACGC TGGTTTGTTT ACTGCTACTA GCGTTGTGAC TGGGATGATC GGTTCGTCTT TCTCAGGTTC CAAACAGGGA GATATTGCAC AAGCTAGAGA AATTGCTCAG GGTAGATATC GTGTTCGAGA TTTTAACTAT ATAAAATTTC GTCAGCAAGC CTTTCAAGTC CGTCTCCAAG CAGCGCAAAA TAATCGGCAA ATTGATATCC CACCCCATCC TACCAATGGC GATGAGGAAC GCTATGCCAA CAAAATTGCT ACAGACACGA GGGGATTACC ACACGATCAA AGAGGCGAAG TTGATCTGCA AGCTTATGAT TCTTTGATTA GGGCATTAAC AACACAGAAC CCAGATGATT ACGAAAGGAT TATTTTGGGT GGTACAAGGA AGTTAGTGAA TCCTCAAGGG CCGTTGGCAA TTAGTTTAGA AGGCATCAAT GCTTCTCAGA TAGCAGTTCC ACCGCCACCA ACCCTAAATA GTGCAGAACA AGCGGCAGAG GCGATTGAAC TCTACTGGCA GGCTTTATTA AGGGATGTGC CTTTCAGTCG ATTTGCTCAA GATCCGAATG TCGCCGCAGC GATCGCCGAA CTTAATAGCC TACCGGAGTT CCGGGGGCCA AAACAGAACG GTTTTGTCAC TCCGCAAACT TTATTTCGTG GTAGCGTTAT CTATGTTGAT CAGCGCGATC GCTCAGGTAG GACAACAAAA TACGTTACTC CTCCAGGAGT GCTAGATGGC CCCCACATTT CCCAATTTCT GTTACGGGAA ATTCCTTACA ATACTCAATT CATTTCACCC TTGATTCGTA CTGCTCTAGC TGGTAAGGAG AATGATTTTC TCACTAATTA CAATGAATGG CTAACTGTCC AGAATGGAGG TAGTTCTGGC AAGTCCATTA AGTTCGATCC CACACGCCGT TTTGTTTTTA CTGTTCGTGA TCTGAGCGAA CTATCACGTA TTGGTGGGGC TTTGTTCTTT GGAGCTTTGT TAATTCTCAA TAGCATTAGC GCACCGTTAA ATCCAGGAAA TCCCTATATC AACTCCAAAA CTCAAGTTGG TTCTGCTGCT ACCTTTGCAT CGGCACATTT TCAAGCCTTA CTCAACTTAG CTCCCTCGCG GGCAATCAGA GCTTCTTATT GGCAAAAGTT TTACGTACAT CGACGTTTAC GACCAGAAGC TTATGGTGGA TTGGTTTATA ACAACATTGT CAATGGAACT AGTTATCCGA TTAATTCCCA AGTCTTCAAT TCCACAGCCT TAGCTCGCAC CTTCAGCACT TTTGGTACTT ATTTGTTACC CCATGCGTAC CCAGAAGGCG CACCATTCCA CTCTTCTTAC ACTGGTGGTG CTGCTTCGAT TGCGGGTGTG CAAGCTACGT TGTTGAAAGC CTTTTTTAAT GAGAATTTTG TGATTCCCAA TCCTGTAGAA CCTGATCCCA ATGACCCCAC CAAACTAATT CCCTACAGTG GCCCAGCTTT GACAGTCGGT GGTGAGTTGA ATAAACTGGC GACGAACTAT TACATCGGTC GTGGTCATGG CGGTATTCAT TGGCGTTCCG ATGGTGCAGC TGGCTTAGCG TTAGGTGAGG AAGTTGCTAT CAGTATTCTT AGAGATGAAA GACTGGGATA CAACGAACGG TTTAATGGTT TTACCTTCAC CAAATTTGAC GGTACAAGAG TGACTGTCTA A
|
Protein sequence | MQPDHDFDQA SQEQNETTNP QSRSSGKRRG RLFGRDGVSR RLFLGHAGLF TATSVVTGMI GSSFSGSKQG DIAQAREIAQ GRYRVRDFNY IKFRQQAFQV RLQAAQNNRQ IDIPPHPTNG DEERYANKIA TDTRGLPHDQ RGEVDLQAYD SLIRALTTQN PDDYERIILG GTRKLVNPQG PLAISLEGIN ASQIAVPPPP TLNSAEQAAE AIELYWQALL RDVPFSRFAQ DPNVAAAIAE LNSLPEFRGP KQNGFVTPQT LFRGSVIYVD QRDRSGRTTK YVTPPGVLDG PHISQFLLRE IPYNTQFISP LIRTALAGKE NDFLTNYNEW LTVQNGGSSG KSIKFDPTRR FVFTVRDLSE LSRIGGALFF GALLILNSIS APLNPGNPYI NSKTQVGSAA TFASAHFQAL LNLAPSRAIR ASYWQKFYVH RRLRPEAYGG LVYNNIVNGT SYPINSQVFN STALARTFST FGTYLLPHAY PEGAPFHSSY TGGAASIAGV QATLLKAFFN ENFVIPNPVE PDPNDPTKLI PYSGPALTVG GELNKLATNY YIGRGHGGIH WRSDGAAGLA LGEEVAISIL RDERLGYNER FNGFTFTKFD GTRVTV
|
| |