Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1099 |
Symbol | |
ID | 6376495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | - |
Start bp | 1413076 |
End bp | 1414086 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 642682211 |
Product | hypothetical protein |
Protein accession | YP_001958171 |
Protein GI | 189502454 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.299802 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAAG CAGTAGTTGT CATACTTAAT CATAATGGCA AAGCTCTTTT ACAAAAGTTT CTACCCAGTG TAATACAACA TAGCCATCCT TATAGGGTAG TAATAGTAGA TAATGCATCG GTAGATGATT CTATTAATTT TTTATCTACT AATTTTCCTC ATATACAGTG TATACGCCAT ACTAAGAACG AAGGTTTTGC TGGTGGGTAT AATTTGGCTT TACAAGAAAT TAAAGCTAAA TACTATATAT TAATGAATGC AGATGTTAAA GTTACTAGTA ATTGGATAGA ACCTGTTTTA GAGTTAATGG AAGGAAATGA GCAGGTATCT GCTTGCCAAC CTAAAATATT ATCATACCAT AAGCATGAGA AATTTGAATA TGCGGGTGCA ACAGGGGGAT TTATAGATTT GTTAGGTTAT CCTTTTTGTC GGGGGCGTTT ATTTACTAGT ATAGAGAAAG ATCTAGGTCA GTATAATGAT ACGCGTGCAG TGTTTTGGGC TAGTGGCGCT TGCATGTTTC TACGAGCTAG CGTCTTTGGG GAGCTAGGTG GGTTTGATAA ACTTTTATTT GCCTACTATG AAGAAATTGA TCTTTGCTGG CGTATGCAAC AGTATGGGTA TAAGATTTAT TATTGTGGCA ATAGTAAGGT ATTCCATGTT GGAAGTGCAA CTATTGGTAT AGATAACCCA TATAAAACTT ATCTGAAATT TAGAAATCGA GCGCTTGTTC TTTATAAAAA CACACCAAGC CATTTTTTAA GCTGGAAACA CATTTTGCGT ATCATATTAG ATTTGTTAGC AGCTTTGCAA GCTGTTTTGC AAGGGCGAGC TAAACACAGT TGGGCTATTT TACAAGCACA GATCGATTTC TTTAAACTAA AAAAGAATTA TAAACCAACT TTAAATACAC AGCAGGTCAA GCAAGTGTAC CATGGCATCC TTCCTTTTGT TTACTTTATA CAAGGAAAAA AAAAGTTTTC TGATTTAAAC CAAGCTAAGT TTAGCAAATA G
|
Protein sequence | MEKAVVVILN HNGKALLQKF LPSVIQHSHP YRVVIVDNAS VDDSINFLST NFPHIQCIRH TKNEGFAGGY NLALQEIKAK YYILMNADVK VTSNWIEPVL ELMEGNEQVS ACQPKILSYH KHEKFEYAGA TGGFIDLLGY PFCRGRLFTS IEKDLGQYND TRAVFWASGA CMFLRASVFG ELGGFDKLLF AYYEEIDLCW RMQQYGYKIY YCGNSKVFHV GSATIGIDNP YKTYLKFRNR ALVLYKNTPS HFLSWKHILR IILDLLAALQ AVLQGRAKHS WAILQAQIDF FKLKKNYKPT LNTQQVKQVY HGILPFVYFI QGKKKFSDLN QAKFSK
|
| |