Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aasi_1729 |
Symbol | |
ID | 6377088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Amoebophilus asiaticus 5a2 |
Kingdom | Bacteria |
Replicon accession | NC_010830 |
Strand | + |
Start bp | 1086793 |
End bp | 1088007 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003573138 |
Protein GI | 294661262 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000763037 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGAAAATCT CCATTATACT ATATAAGAGT AAAACGCTGG CTGATGGCTC TCATCCGCTG ATGGTCCGTA TCAGCCAATT GAAAGCTAAG AAATATTTCT CTACTAGCCT CTCTTGCCAG GCTAGTTTAT GGGATTTTGA TAAGCATACC CCTAAAAGAA GTCATCCAGA CCGTAAATTA CTCGAGGCGA TTCTTGCGCA AAAGAAAGCT AGCTACCACA CAAAACTTTT AGAACTTGAA AGTGAACAGA AAACACTTTC GCTCCAGCAG CTCGTTCAAG CAATCGAAGA GCCTAGACAA ACTTCACAGG AACTATTCCC TTTTTTATCA GAAGTGTGCG AGAATCTAGT TCAAAGTGGT AAAATAGGGA GGGCCCGATT ATATAAGCGA CTGTTAACTT CTTTAAAAGA GTTTACAGGG ACTAAGCAGC TCTCGTTTAG TGATATTGAT ATGCCTTTTT TAAACAGTTA TGAGGCTTTT CTCTATAAAC AGGGGTTGGT TGAGAATTCA ATTGGCACTT ATTTCCAAGT CTTAAGGGCT CTTCTCAATA AAGCCATCCA AGCGAAACGT ATGAAAAAGG AGCATTATCC TTTTGATAAC TTCTCCTTGG GAAAGTTTAG CACACTCACC CAGAAGCGTG CAATGAGCCG GGAAGATCTA CAGCAGATTA TAGCATTACC TTTAGCGGCT GATAGTAAGC TCCAAGTGGC TAGGGATTAC TTTTTGTTTA GCTATTACGG ACAAGGCATG AATTTTAGGG ATATGGCAAC CCTTAAATGG AAACAAATCA TTAAAGATTG TGTGGTATAT ACGCGCCTAA AGACAGGTAA GATGATGCAG TTCAAACTGA TGTCGCCCGC GCTAGAGATT CTGGACAGAT ATAAGGAGCA TCCAAGTGGT CATTTAGATG ATTTTGTTTT TCCAATCTTA AGTAAAAATG AGCATGTCAC GCCTCAGCAA ATTTCCAATC GAATTAATCG CGTACTTCGG CAAGTGAATG CTTCACTCAA AGAATTAGCC GAGGCGGCCA AGGTACCAGT TCACCTGACG ACCTATGTAG CACGGCATAC TTATGCAACC GTGCTCAAGC AAAGTGGCAT CTCAACGGGG GTGATTAGTG AAGCGCTCGG ACACAAGGGT GAGAAAATTA CGCAGACTTA TCTGAAGAGT TTTAGCAACG AAGTAATTGA CGAGGCGAAT AGTTTCTTAC TATAA
|
Protein sequence | MKISIILYKS KTLADGSHPL MVRISQLKAK KYFSTSLSCQ ASLWDFDKHT PKRSHPDRKL LEAILAQKKA SYHTKLLELE SEQKTLSLQQ LVQAIEEPRQ TSQELFPFLS EVCENLVQSG KIGRARLYKR LLTSLKEFTG TKQLSFSDID MPFLNSYEAF LYKQGLVENS IGTYFQVLRA LLNKAIQAKR MKKEHYPFDN FSLGKFSTLT QKRAMSREDL QQIIALPLAA DSKLQVARDY FLFSYYGQGM NFRDMATLKW KQIIKDCVVY TRLKTGKMMQ FKLMSPALEI LDRYKEHPSG HLDDFVFPIL SKNEHVTPQQ ISNRINRVLR QVNASLKELA EAAKVPVHLT TYVARHTYAT VLKQSGISTG VISEALGHKG EKITQTYLKS FSNEVIDEAN SFLL
|
| |