Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | WD1034 |
Symbol | |
ID | 2738855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Wolbachia endosymbiont of Drosophila melanogaster |
Kingdom | Bacteria |
Replicon accession | NC_002978 |
Strand | + |
Start bp | 994720 |
End bp | 995940 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637173189 |
Product | hypothetical protein |
Protein accession | NP_966758 |
Protein GI | 42520843 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAAAA GTAATTTTTT TGTATGGTTG ATAGTATCTC TTTTCTATGC ATACCAGTAT GTGTTACGTG TCATGCCAAA CATAATTGCG CCTGTATTAA TAACAAAATT TAATATAAGC ATTGCAGATT TAGGGCAATT TAATGGTCTC TACTATGTGG GATTTACACT AGTTCATATC CCTGTTGGTC TTTGTTTTGA TAGATTCGGT CCAAAAATTG TTTTACCTAT TTGTGCTGTC TTGGTATCTA TTGGAACATT ACCTCTTGTG TGCTTTGATG AGTGGTATTA TTCAGTATTA GGTAGAATAA TTGTTGGGAT TGGCTCGTCT GCGTCAGTAA TAGGAGTATT TAAAGTTGCC AGTATGTATT TTCCGCGAGA AAAATCAGCA AAAATGGCAA GCATATCTGT TATTATAGGG CTATTAGGGG CAACATATGG TGGTCTACCA ATAGAACTTT TGCTTGATGA GTTTGGTTGG AATTACGTAA TTTATATTCT TTCAGGGTTT GGTTTCTTAC TCGCTTTATC TTTATTTTTA GTTATTCCTT ACAATGCTTA CGACTTTCGA AAAGAAAAAA TTAGCATGAA GGATCTAAAA ACTGTGCTTT TCAATAAATA TATCATTCTA ATTAGTTTTT TTGGTGGCTT AATGGTAGGT CCACTAGAAG GTTTTGCTGA TGCTTGGACA AAAGCTTTTT TATGTGAGGT ATATCAAATG GCCGGTGATC TAGCGTCTTC AATTTCTTCT GTTATATATG TAGGATTTGC AACTGGATTG CTCTCTTTTG CTCACATATT AGAAAAATAT CCAAATAGAC ATTATGAAGT TATTATTGTC TGTTCCTTTG CAATGATTGC TAGTTTCCTT TTACTTTTTA CACAGGCTGG AGGATTGTAT ATTGTATTGC CTGCACTTCT TGTTATAGGT TTCGCTTCTG GATATCAAAT AGTAACACTG TACAAAGCAT TAAGTTATGT AAATAATAAC TTAATAGGAT TAACTACAGC TGTGTCAAAT ATGATAGTCA TGGTTTTTGG CTATTTTTTT CACACTGGGA TCGCAAAAAT AATAGATTTG TGTTGGGATA GGACAGTGAT ACAAGGAAAT CCTGTGTATA GTGCTGAATT GCTGATAAAA GCAACATCTA TTATTCCTGT ATGTTTGCTG ATAGCTGTTT TTGGGCTCTT ATGGTTAAAA AAGCAGGATA AGCAGTCTTA A
|
Protein sequence | MSKSNFFVWL IVSLFYAYQY VLRVMPNIIA PVLITKFNIS IADLGQFNGL YYVGFTLVHI PVGLCFDRFG PKIVLPICAV LVSIGTLPLV CFDEWYYSVL GRIIVGIGSS ASVIGVFKVA SMYFPREKSA KMASISVIIG LLGATYGGLP IELLLDEFGW NYVIYILSGF GFLLALSLFL VIPYNAYDFR KEKISMKDLK TVLFNKYIIL ISFFGGLMVG PLEGFADAWT KAFLCEVYQM AGDLASSISS VIYVGFATGL LSFAHILEKY PNRHYEVIIV CSFAMIASFL LLFTQAGGLY IVLPALLVIG FASGYQIVTL YKALSYVNNN LIGLTTAVSN MIVMVFGYFF HTGIAKIIDL CWDRTVIQGN PVYSAELLIK ATSIIPVCLL IAVFGLLWLK KQDKQS
|
| |