Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2681 |
Symbol | |
ID | 7976503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2712881 |
End bp | 2714062 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644799481 |
Product | transposase IS4 family protein |
Protein accession | YP_002950640 |
Protein GI | 239828016 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000123658 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGAT TAGCACATCA TCAAGGAATT CACAAGTTTT TCACGATGTT GGGGTTGACC CTTTATTTTT CAAAACCTGT TATGAAGCAT CTCGTTCATA TCGTGGATGC GATGATCACG AAGGGCTTTT CGGGAACATT GACCGATCTT CATCATGGGA GCTTTCATCC GAACCACCGC ACGACACTCA GCCATTTTTT CACGAAAAGT CCGTGGGAGG AAGAGACACT GCTTCGCAAA CTCCAGCAGT GGATCCTTCG TCGTGTCGAA CGCATCGCCA AACAGGAGAA TCAACCCTTG TTTGTTTCGA TCGATGATAC GATTTGCCAA AAAACCAAGC CTTCGTCACG GGCAACACAC GCGATTCAGG GATGTGATTG GCACTACTCT CACTCAGAGA AAAAATCGAT TTGGGGCCAT TCTCTTGTTT GGTTCATGGT TCATACTGCA ACCCAGGCGT TTCCCTTTGC CTTCCGCCTC TACGACAAGA CGGCGGGAAA AAGCAAAGGG GAACTCGCGA TCGAGATGCT TTCTTCGTTG GATGTACGCC GTCCCGTTTA TGTGCTGATG GATTCGTGGT ATCCATCGAA AGCGCTCGTG GAAGCTTGTC TGAAAAAAGG ATTCCACGTC ATCGCGATGC TCAAGACGAA CCGGATTCTC TATCCGAACG GCGTTGCCGT CCAAGCGAAG CAGTTGGCCC GCTCCATCGA ACCGAATGAC ACTCACCTCG TCACGGTGGG AGAAGAGCAT TATCGCGTCT ATCGTTACGA AGGAGCGCTC AACGGTCTCG ACCATGCGGT GGTGCTGCTC GCTTGGAAAG CCGATCAGCC GATGACATCG GAACATCTTC ACTGCGTCTT GAGCACCGAC CGGGAGCTAA GCGATGAAGA GATCTTGCGC TACTATGCCC AGCGTTGGTC GATCGAATGC TTTTTTCGAC AAGCGAAAGA CCAGCTGAAG CTCGATGGGT ACCGCGTTCG TCAACGTTGG GCGGTGAAAC GGTATTGGAT CTTGGTGCAA CTCGCTTATG TGTACAGTAT GGTCGAATCC AACAGCGATT TCTCTACCGG GCTTGACTTC CTTCGAAAGA AGAAAGGACA TAGCCTCGTG GAGTTTATTT ACGATGCAGC GAAACAAGAT ATTCCCATTG ATGTCGTTAA AAAACAGCTT CATGTGGCAT AA
|
Protein sequence | MNRLAHHQGI HKFFTMLGLT LYFSKPVMKH LVHIVDAMIT KGFSGTLTDL HHGSFHPNHR TTLSHFFTKS PWEEETLLRK LQQWILRRVE RIAKQENQPL FVSIDDTICQ KTKPSSRATH AIQGCDWHYS HSEKKSIWGH SLVWFMVHTA TQAFPFAFRL YDKTAGKSKG ELAIEMLSSL DVRRPVYVLM DSWYPSKALV EACLKKGFHV IAMLKTNRIL YPNGVAVQAK QLARSIEPND THLVTVGEEH YRVYRYEGAL NGLDHAVVLL AWKADQPMTS EHLHCVLSTD RELSDEEILR YYAQRWSIEC FFRQAKDQLK LDGYRVRQRW AVKRYWILVQ LAYVYSMVES NSDFSTGLDF LRKKKGHSLV EFIYDAAKQD IPIDVVKKQL HVA
|
| |