Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_0378 |
Symbol | |
ID | 6377305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 450125 |
End bp | 451198 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642681547 |
Product | hypothetical protein |
Protein accession | YP_001957529 |
Protein GI | 189501812 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.978276 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAAA CAAGCCAAAA CTTCCTTTTT AAATACCTTA ATAATAGCTC TCCTACAGGT TACGAAGCTA GTGGCCAAAA GATTTGGCTA GAATATTTGT ACCCTTATGT AGATGATATA GTTACCGATG TGTACGGCAA TGCAATAGGG GTTATTAATC CACAAGCAGA ATATAAAGTA GTAATAGAAG CACATGCAGA TGAAATTTCT TGGTATGTTA ACTATATCAG TAAAGAAGGA TATATATATG TAATCCGTAA TGGAGGCTCA GACTATGAAA TAACACCTTC TATGCGTGCC AAAATACATG TAGGGAATAA AACTATACCA GCCGTATTTG GCTGGCCAGC CATACATGTA CGTGAAGACA AAAAATCTGG AAAAACTCCG GACATAGAAA CAGCTATCTT GGACTGTGGT TGCCATTCAG ATAAGGAAGT ACAAGAGTTA GGTATCCATG TAGGTTCTGT GGTTACCTTT GATGCTGACT TAATAACACT AAACGAAAAA TATTTAGTAG GACGAGGTCT AGATAACCGC ATAGGTGGTT TTATGATTGC ACAAGTAGCT AGAAAGCTAT ACGAAAATAA AATCAGGTTA CCTTTTGGCC TATATATTAC AAATGCAGTA CAAGAAGAAG TAGGACTACG AGGAGCTGCT ATGGTAGCCA ATCGAATAAA GCCTGATTTG GCTATTGTTA CAGATGTAAC ACATGATACT CAAACACCCC ACTATAACAA GGTTAAACAA GGAGACATAG CTTGTGGTAA AGGGCCTGTA TTAACTCATG CACCTGCTGT ACAACATAAA GCTTTAAAAC TATTGGTTGA TGCAGCTAAA AACCATAATA TACCTATACA ATGGCAAGCC AGCTCACGCA GCACAGGCAC TGATACAGAT GCGTTTGCAT ATTGTGGCGC AGGTATCCCT TCGGCACTTA TCTCGCTACC ACTTAAGTAT ATGCACACAA CAGTAGAAAT GGTGCATAAA AATGATATAG AGCAGATTAT AGAATTATTC TACCAATTGT TAACAAATTT ACAACCTAAT CATAGTTTTA GCTACTTCGA TTAA
|
Protein sequence | MEQTSQNFLF KYLNNSSPTG YEASGQKIWL EYLYPYVDDI VTDVYGNAIG VINPQAEYKV VIEAHADEIS WYVNYISKEG YIYVIRNGGS DYEITPSMRA KIHVGNKTIP AVFGWPAIHV REDKKSGKTP DIETAILDCG CHSDKEVQEL GIHVGSVVTF DADLITLNEK YLVGRGLDNR IGGFMIAQVA RKLYENKIRL PFGLYITNAV QEEVGLRGAA MVANRIKPDL AIVTDVTHDT QTPHYNKVKQ GDIACGKGPV LTHAPAVQHK ALKLLVDAAK NHNIPIQWQA SSRSTGTDTD AFAYCGAGIP SALISLPLKY MHTTVEMVHK NDIEQIIELF YQLLTNLQPN HSFSYFD
|
| |