Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1998 |
Symbol | |
ID | 7978954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2054657 |
End bp | 2055928 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644798825 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002949995 |
Protein GI | 239827371 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000565006 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATAC GCGACTGGGA TTGGAATTTA AAGGTGCGCT TATTTGGCGA GGCGCTCGTC CATATCGCAT TTTGGATGTT TTTTCCGTTT ATGGCGATAT ATTTTACTGA GTCATTTGGA AAGGATAAAG CAGGACTATT ATTGATTGTT TCGCAAATTT TTTCCGTGAT CGCTAGTTTG ATGGGCGGAT ATTGCGCGGA TGTATTCGGA CGCAAGCGGA TGATGGTGTT GTCTGCCTAT GGTCAAGGGG CAGCGTTTTT CTTTTTTGCG CTTGCTAGTT CTCCGTGGTT TACTTCGCCG TTCGTGGGGT ATCTTTGCTT TACGATCGCC GGAATTTGCG GGGCGTTTTA CTGGCCCGCA AGCCAGGCGA TGGTTGCCGA TGTGGTGCCG GAAGAACATC GCAGCAGCGT CTTTGCCGTG TTTTATATGT CCATCAATGT TGCTGTTGTG GTCGGACCGA TTATTGGCGG GGTGTTTTAT GAACATTATC TTTTTGAGTT ATTGCTTGCT ACGGCATTTT TGTTTTTGTT TCTCGCTGCG GTATTGACAA AATGGCTTCG TGAGACTGTG CCCGCGCGTG AAAACGGAGA GCTGCTGCAC GGGAAATGGT ATGAGTTTTT GTGGCAGCAA GTCCGTCAGT ACAGCGTGAT TGTGCGCGAC CGAATATTTC TATTGTTTAT CATTGCAGGT ATTCTTGTCG CACAGACGGT TATGCAGCTG GATTTACTGA TTCCTGTGTA TACAAAAGAT GCGGTGGAGA GGCAAACGTT ATTTTCCTTC GGTGATTGGT CGTTTACGCT TACTGGAGAG AAAGCATTTG GCCTTCTTAT TTCCGAAAAC GGCTTGCTTG TTGTGCTGTT TACCGTGTGG GTAACGAAAT GGATGGAACG TTATCATGAA CGGACGGCGT TTGTTGGGTC GTCCATTGTT TATGGAATTG CGATTTTTTT GTTTGGACAA ACGACATCGA TTTGGGGACT TATTTTGGTG ATGGGATTGT TTACGTTTGG GGAGCTGATG ACAGTAGGAA TCCAGCAAAC GTTTATTTCC AAGCTTGCTC CGGAGGAGAT GCGCGGGCAG TATTTTGCGG CTGCGAGCTT GCGCTGGACG ATCGGACGGG CGATTGCCCC GCTTTCGATC ACGGCAACGA TATGGCTCGG GTATGAATGG ACGTTTTTCT TGTTAAGTAT GCTTTCTTTC ATAAGTGCGG CCTTATATAT GATTATGTTT CAATTGCTTG AAAAACGGCA AACACACAAA GCATTAACAT AA
|
Protein sequence | MRIRDWDWNL KVRLFGEALV HIAFWMFFPF MAIYFTESFG KDKAGLLLIV SQIFSVIASL MGGYCADVFG RKRMMVLSAY GQGAAFFFFA LASSPWFTSP FVGYLCFTIA GICGAFYWPA SQAMVADVVP EEHRSSVFAV FYMSINVAVV VGPIIGGVFY EHYLFELLLA TAFLFLFLAA VLTKWLRETV PARENGELLH GKWYEFLWQQ VRQYSVIVRD RIFLLFIIAG ILVAQTVMQL DLLIPVYTKD AVERQTLFSF GDWSFTLTGE KAFGLLISEN GLLVVLFTVW VTKWMERYHE RTAFVGSSIV YGIAIFLFGQ TTSIWGLILV MGLFTFGELM TVGIQQTFIS KLAPEEMRGQ YFAAASLRWT IGRAIAPLSI TATIWLGYEW TFFLLSMLSF ISAALYMIMF QLLEKRQTHK ALT
|
| |