Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4176 |
Symbol | |
ID | 3681037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5225621 |
End bp | 5226622 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637719523 |
Product | hypothetical protein |
Protein accession | YP_324670 |
Protein GI | 75910374 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000217029 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGAACTG TTTACGTTAC ACAGGATGAT TCATTTATTA GCAAAATTGA TGAACGTCTG AATGTTAAAT TTGAGAAGAA GGTAATTTTA GACGTACCTT TAATTAAAAT TGATGGGTTA GTTGTGATGG GAAGGGCTAG TATTTCCCCT GCGGCTATTT TTGAACTCAT TGATAAGAAA ATTCCCCTCA CATTTCTCAC CAATAACGGT AAATATCTTG CTAGTTTAGA GCCAGAAATG GGTAAAAATA TTTTCGTCCG TGCTGCTCAA TGGAAAGCTG CTGGAGAATC AGACCAAGCA ATTCATGTTA CCCAAGGTTT TGTTAGAGGT AAACTGAAAA ATTACCGCCA TAGTTTACTA GAAGCACAGC GCAGATATGA GGTTGATTTA AATAGTAATA TCAGTCAATT ATCTCATGCG ATCACCTCCA TCGACAAAGC CAATTCAATT AATACAATCA GGGGTTTAGA AGGTGCTGGT AGTGCTGCCT ATTTTGGCTG CTTTAATCAA CTAATCCGAG TTGATAATTT CAGCTTTCAT ACCCGCAACC GTCGCCCACC TATAGACCCG GTAAATTCAC TCTTGAGTTT AGGATATTCC TTACTACGTC ACGATATTCA AGGAGCATTA AATATCGTCG GTTTCGACCC CTATTTAGGA TACCTACATA CAGAAAGATA TGGTAGACCT TCTCTAGCAT TAGATTTAAT GGAAGAGTTT CGACCCTTAA TAGTCGATGC TGTTGTTCTC ACAGCAATTA ATCGCCGGAT GCTTTCACCA AAAGACTTTG TTACTGAACC TATAAGTAAC GCTGTTTCAC TGACTAAAGA AGGTTTACAT ATTTTCTTGA GATTATATCA AGAAAAGAAA CAAACCCAAT TCAAACATCC TGTTATGCAG AAAAAGTATA CCTATCAAGA AACTTTTGAA ATTCAAGCTA GACTCCTAGC TAAATACCTC ATGGGAGAAC TAGATAAATA CCCCCCTTTG GTCATGAGAT AG
|
Protein sequence | MGTVYVTQDD SFISKIDERL NVKFEKKVIL DVPLIKIDGL VVMGRASISP AAIFELIDKK IPLTFLTNNG KYLASLEPEM GKNIFVRAAQ WKAAGESDQA IHVTQGFVRG KLKNYRHSLL EAQRRYEVDL NSNISQLSHA ITSIDKANSI NTIRGLEGAG SAAYFGCFNQ LIRVDNFSFH TRNRRPPIDP VNSLLSLGYS LLRHDIQGAL NIVGFDPYLG YLHTERYGRP SLALDLMEEF RPLIVDAVVL TAINRRMLSP KDFVTEPISN AVSLTKEGLH IFLRLYQEKK QTQFKHPVMQ KKYTYQETFE IQARLLAKYL MGELDKYPPL VMR
|
| |