Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0852 |
Symbol | |
ID | 6376957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 1080998 |
End bp | 1083367 |
Gene Length | 2370 bp |
Protein Length | 789 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642681991 |
Product | hypothetical protein |
Protein accession | YP_001957952 |
Protein GI | 189502235 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00159871 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTTTAATA AAGATTATGC TAAAGTAATA GATATGAAAA TCAATTATAC ACTCGCACAG CAACTGATAG CACGTTTTTT ACTTATCAGC CTTTTTTTAC AAAGCTGCGG AGGAGGATTT AGTAGCAATC CAATTATTTC AGCCGGGGAA GAGCAAACAG TATCTATACA AACTAATACA CAAACAATTC TTTCTCAAGC AGATATCCAG CCTTTGGTCG ATCAAATGTT GATAGCCCAG GGAGGACATG CGGTTACTTT TTATGAAGAA GCCGGTGAGT TACGAGCTAA TGTCAAGATG AATGCACCTC GAGGATTTAG TAAAACATAT GAAGGATTAT CAGTAACGCT TGAGCAAGGA GCAGAAGTAA CGAGATTACC ACAGCTAAGT GAACAGGCGC AGCAACGCCG CATCCAGCTT CAATTAGCCA AAGACAACCA GGCACCTACA GTAGTGATCT ATAAAGGAGC AGGGCTCGCG GGAGGAATGC AGGAAGGAGA GGAGAAAGCC GATCAGTCTA TACCTAACGA ATGCCTTTGT CCCATTACTC ATGAGCTGAT GGAAGAGCCT GTTATTGCAC GGGACGGGCA TACGTATGAG CGAACGGCCA TTGAGCGATG GTTTTCAATG GGCAAACGCA CTAGCCCTGT CACAGGAGCT AAAGTTAGTA GCACTAAGTT AATACCTAAT TATGCTATAC GCAGCCTAAT TAAGAGTTTA AAGACACGCA ATTCTAGTAT AGCTAAGCCT ACACTCGGTA TGCAAAGTGC AAGACAGCAA TCATTACCAT CTGAAGCTTG CTTTTTTGTA AGTGATAGCG TTATTGATAA ACTATTTTCT CAAGCTAAAA GAGGGGATTT TGCAGCATTA AGTAAGCTTT TAAATTTAAG TGAAAATCAT CATGCCCATG CTCAATATAA AGTGGGAGTT ATGTATGAAA AAGGTCAAGG AGTAGCTAAA GATGCAAGGA ATGCAGTAGA ATGGTATCAA AAAGCTGCTA ATCAAGGACA TGCAAGAGCA CAATTTGAGC TAGGCATGAT GTATGATTAT GGAAAAGGAG TAGAAAAAGA TACAAGTAAA GCAATAGAAT GGTATGAGAA AGCTGCTAAT CAAGGACATG CAGATGCACA ATTAAAAGTA GGAGCGAAAT ATTTCAACGG TGAAGGCGTA GCACAAGATT ACATCAAAGC AGTAGAATGG TTCCAAAAAG CGGCTAATCA AGGGAATTTA GATGCACAAT ATAATCTAGG AGTTATGTAT GGAAATGGAA AAGGAGTAGA AAAAGATGCA AGAAAAGAAT TAGAATGGTA TGAGAGAGCT GCTAGAAAGG GAGATGCAAG TGCACAATAT AATCTAGGGC AGATATATGC AAATGGGCAA GGAGTAGCCA AGGATTATGT TAAAGCAATA GAATGGTATG AGAAAGCAGC CAACCAAGGA GATGCAAGTG CACAATTTAA TTTAGGGGTT ATGTATGGAA AAGGACGAGG AGTAGAAAAA GATGAGAAGA AAGCAGTAGA GTGGTATAAA AAAGCAGCCG ATCAAGGATA TGCTCCTGCG CAGTATAGCT TAGGATGTAT GTATGCCAAT GTTCAAAGAG TAGTTAAAAA TGATAAAAAA GCAATAGAAT GGTATAAAAA AGCAGCTAAT CAAAGACATG CAGAGGCACA ATCTAATTTA GGCATAATGT ATGCGAATGG ACGTGGAATA GCAAAAGATG AAAAGAAAGC AGTAAAATGG TATAAAAAAG CAGCTGATCA AGGAAATGCA AAAGCACAAT TTTACCTAGG AGTAAGATAT GAAAATGGAC GGGGAGTCGC TAAAGATGAA AAGAAAGCAG TAGAATGGTA TGAAAAAGCA GCTGAGCAAG GGCATACAGG GGCACAAAAT AACCTAGGGG ATATGTATGA AAATGGAAAG GGTGTAGCTA AAGATTATGT AAAAGCTGTA GAATGGTTTG AAAAAGTAGC GAATCAAGGG CATGCACTTG CACAATATAA TTTAGCTCGG ATGTATGACT ATGGACAAGG TGTGGTCCAA AACTACCAAG AGGCAGTAAA ATGGTATGAA AAAAGTGCTG GGCAAGGAAA CAACTATGCT AAGGCCTATC TAGGTCGCAT GTATTATCAC GGTTTTGGAG TTGAGAAAAA TCTTTTACAA GCAAGTAAAT TAATCCAGGA GGCTATTAGC CATATGAAAA GCAAGGCCGA GGAAGGGTGT ATAGAAGCTC AATATATAGT AGGATGGATG TATCAATATG GCCAAGGGGT CATGCAAGAC CACGTGGAGG CAGCAGTGTG GTATAAGAAA TCAGCTAACG CTTATCCAGC AGCACAAAAA GCATTAGATG AGTCGAACCA TGCTGGCTAA
|
Protein sequence | MFNKDYAKVI DMKINYTLAQ QLIARFLLIS LFLQSCGGGF SSNPIISAGE EQTVSIQTNT QTILSQADIQ PLVDQMLIAQ GGHAVTFYEE AGELRANVKM NAPRGFSKTY EGLSVTLEQG AEVTRLPQLS EQAQQRRIQL QLAKDNQAPT VVIYKGAGLA GGMQEGEEKA DQSIPNECLC PITHELMEEP VIARDGHTYE RTAIERWFSM GKRTSPVTGA KVSSTKLIPN YAIRSLIKSL KTRNSSIAKP TLGMQSARQQ SLPSEACFFV SDSVIDKLFS QAKRGDFAAL SKLLNLSENH HAHAQYKVGV MYEKGQGVAK DARNAVEWYQ KAANQGHARA QFELGMMYDY GKGVEKDTSK AIEWYEKAAN QGHADAQLKV GAKYFNGEGV AQDYIKAVEW FQKAANQGNL DAQYNLGVMY GNGKGVEKDA RKELEWYERA ARKGDASAQY NLGQIYANGQ GVAKDYVKAI EWYEKAANQG DASAQFNLGV MYGKGRGVEK DEKKAVEWYK KAADQGYAPA QYSLGCMYAN VQRVVKNDKK AIEWYKKAAN QRHAEAQSNL GIMYANGRGI AKDEKKAVKW YKKAADQGNA KAQFYLGVRY ENGRGVAKDE KKAVEWYEKA AEQGHTGAQN NLGDMYENGK GVAKDYVKAV EWFEKVANQG HALAQYNLAR MYDYGQGVVQ NYQEAVKWYE KSAGQGNNYA KAYLGRMYYH GFGVEKNLLQ ASKLIQEAIS HMKSKAEEGC IEAQYIVGWM YQYGQGVMQD HVEAAVWYKK SANAYPAAQK ALDESNHAG
|
| |