Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0914 |
Symbol | |
ID | 6377018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1168077 |
End bp | 1170818 |
Gene Length | 2742 bp |
Protein Length | 913 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642682048 |
Product | hypothetical protein |
Protein accession | YP_001958009 |
Protein GI | 189502292 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1074] ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.230756 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAGC TATATATATA TAGATCTGCT GCTGGTTCTG GAAAGACGTA TGTATTAGTG AAGGCTTATT TACAATTAGC GTTGAGAGCT CCCTTATACT TCCAAAGAAT TCTAGCAGTA ACTTTTACCA ATCGTGCTAC GCAGGAGATG AAGCAACGTA TTCTTAACAG CTTGCATGAC ATAGCACAAG GTAAGGAAAG CCTTCTTACT CAAGAATTAA ACCAAGCAAA TGGATGGGAT AGCAAAGAAT TACAAAAACG TGCCCAAGCG GTACTCTCTA AGGTCTTACA TAATTATGAC CATTTTAGCG TAGGCACTAT TGATAGTTTT TTACAAAGTA TTGTTCGGAA CTTCTCAAAG GAGCTTGGCA TCCAACATGG ATTTACCATA GAGATGGACC AAGAAACTAT ACTGAACTAT ATAATAGACG ATGTAATTAA TACAGCTAAC CAAGACAAAC AGCTACATCA ATGGCTAGTT AATTTTGCTG AAAATAAACT CTTGGCTGGC AAATCCTGGC ATTTTAAACA AGCACTTAAA CAGCTAGGAT ATGAACTTTT TACAGAAAAT TTTGGGCAGC AAGAAAGATT ACTGATAGAA GCTATCAACA ACAAACATAA GCTAGCTACA TTTTTAGCAG AACTTGAAAC AGGTAGGCTT GAATTTGAAA ATAGTTTGCA GAAGCTTGGC AAAGAAGCTA TGCAGCAAAT AGAAGTTAGT GGGTTAGAAA TATCAGACTT TTCTTATGGG CAACGTGGAA TAGCAGGCTA CTTAATGGGA GTATCTGAAA AAAAGGGCTT TACTCCTACC CAACGAGCTC TCACTGCTTT AGAATCTATA GAAGCTTGGT ACAGTAAAAC TAATTCTAAA AAGCTATCAA TTGTATCCTT AGTACAAAAT AGTTTACAAG ACATTCTAAA AGAAATTATA ACATACTACC AAGCAGGACA TCATATTTAT CATACAACAC TTGCTGTACA ACAGTTTATA TATGCATTTG GTATTATTAC ACACCTATTA GCAAGTTTAC GGAATCTGAG GGCTGAGAAG AATATAATGC TTATTTCAGA TGCAGCTAAC TTGCTACGAC AGGTTATTGC TGAGAACGAC ACACCTTTTA TATACGAAAA GGTAGGCTCT TTTTATAACC ATTTCTTAAT AGACGAATTT CAAGATATTT CAGATTTCCA ATGGCAAAAT CTAAAACCAT TAATTAGCAA TGGATTGGCT ACAGGACATA TGAGTTTATT GGTAGGCGAT GCAAAGCAAT CTATCTACCG ATGGCGTGGA AGTAAGTGGC AGCTTTTATC GAATAAATTA GAAAAAGAAT TTACAGCTAC TAAAAGTTTA GTACTAGAGC ACAACTGGCG TAGCAAACCC AGTATTGTAC ACTTTAACAA TACCTTTTTT ACGCAAGCGA GCAAAAATTT AGCAAGTCAT CTACAACAAG AAATTAATCA ATTAGAAGAC AACAGCACTT TAAAACAACA GTTAAATGAG CAGCTTCAAG AGATAGCAAA TGTTTATGCG CATGCTTACC AACATATACC AGCACCTGTG CAGTCAAGCC AAGATCAAGG TTATGTAGAA GCCAACTTTT TGTGTGAGGC AGATTTACAA GAAGAAAAAT CTAGCTGGAA AGAACAAATC AAGCAACGTT TACCAGCCTT GTTAGAAGAA TTACAAAAAG ACGGCTTTAG ATTGCAAGAT ATAGCTTTGC TAGTAAGAAG TCATGCAGAA GGACGTGAAA TATCTCAAAG TTTACTCAGT TACCAGCACT CAGAACATGC AAAACCAGGG TACAAGTACT CAGCGGTTTC TGCTGAATCT CTTTATTTAG GGAAAAGCCC TTGGATAAAT ATAATAATTA GTGCGCTTAA ATACTTAGAA GATGAGATAG ATATACTAGC TAAAACAGAG CTCGTTTATT TATACCAAAT ATATGTTTGT AAAAAAGAAC AAGGTATTTC TCATGAGCTT TTTCAGCAAA ACCGTGTGGA AAATGACCAT AATTTACTTC CTACCGAGTT TATATCAGAA TTTTATTGTT TAACTAAGCT TCCATTATAT GAACGCATAG TAAAATTAGT AAGTATTTTT CAGCTTAATA CAACAGCAAG CAAACCTTTT ATATATACTT TTCAAGATAT TGTATTAACA TATTTACAGC AAAATCCGGC AGAGCATTAT AATTTTTTAA AATGGTGGGA AGAGAAAGGA AACAAGCATG CACTACCTCA TATGGAAGGG GAAGAAGCCA TACCTATTAT GACCATTCAC CAAGCAAAAG GATTGCAGTT CAAAGTAGTT ATCGTTCCAT TTTGTGCATG GAATCTTGAC CATAATACTT ATAAGCCACC TATCATTTGG TGTTCAACAG ATAAAGCGCC TTTTTCAACT TTTCCCAGTT TGCCGCTTAG ATACCATAAG GGATTACAAG AAACAGTATA TGCACAAGCA TATTATGAAG AACGTATGCA AGTATATTTA GATCATTTCA ATTTGCTCTA TGTAACGCTT ACCAGGGCAG AAGAGCGATT GTATATATTT TCACAGCAAC CTGGGAAGAA TAAATTAGAT ACTACTGCTG ACTTACTATA CCGAACTATA AGTAGGCCAC CCTTAAAATT TAATGATAAC GAGGATAGCA ACAAATATTT CTTGAAATGG GAACATTATT GGCAAAATGA CAACCAAAAG TTAGTAATAG GAAACCCTAT AGCTAGCAAC CAGCAGCAGT AA
|
Protein sequence | MSKLYIYRSA AGSGKTYVLV KAYLQLALRA PLYFQRILAV TFTNRATQEM KQRILNSLHD IAQGKESLLT QELNQANGWD SKELQKRAQA VLSKVLHNYD HFSVGTIDSF LQSIVRNFSK ELGIQHGFTI EMDQETILNY IIDDVINTAN QDKQLHQWLV NFAENKLLAG KSWHFKQALK QLGYELFTEN FGQQERLLIE AINNKHKLAT FLAELETGRL EFENSLQKLG KEAMQQIEVS GLEISDFSYG QRGIAGYLMG VSEKKGFTPT QRALTALESI EAWYSKTNSK KLSIVSLVQN SLQDILKEII TYYQAGHHIY HTTLAVQQFI YAFGIITHLL ASLRNLRAEK NIMLISDAAN LLRQVIAEND TPFIYEKVGS FYNHFLIDEF QDISDFQWQN LKPLISNGLA TGHMSLLVGD AKQSIYRWRG SKWQLLSNKL EKEFTATKSL VLEHNWRSKP SIVHFNNTFF TQASKNLASH LQQEINQLED NSTLKQQLNE QLQEIANVYA HAYQHIPAPV QSSQDQGYVE ANFLCEADLQ EEKSSWKEQI KQRLPALLEE LQKDGFRLQD IALLVRSHAE GREISQSLLS YQHSEHAKPG YKYSAVSAES LYLGKSPWIN IIISALKYLE DEIDILAKTE LVYLYQIYVC KKEQGISHEL FQQNRVENDH NLLPTEFISE FYCLTKLPLY ERIVKLVSIF QLNTTASKPF IYTFQDIVLT YLQQNPAEHY NFLKWWEEKG NKHALPHMEG EEAIPIMTIH QAKGLQFKVV IVPFCAWNLD HNTYKPPIIW CSTDKAPFST FPSLPLRYHK GLQETVYAQA YYEERMQVYL DHFNLLYVTL TRAEERLYIF SQQPGKNKLD TTADLLYRTI SRPPLKFNDN EDSNKYFLKW EHYWQNDNQK LVIGNPIASN QQQ
|
| |