Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2027 |
Symbol | |
ID | 7978980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2084715 |
End bp | 2086283 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644798849 |
Product | plasmid pRiA4b ORF-3 family protein |
Protein accession | YP_002950019 |
Protein GI | 239827395 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000415443 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATTTATC AACTAAAGGT GCAGTTAAAA GGTATTCGCC CGCCGGTATG GCGGCGGTTA TTGGTGCCAG GCGATATGAC GTTTGCCGAG CTTCATCGCG TGCTGCAAAA AGCGTTTGAC TGGGAAGACC GGCATTTGCA TACGTTTTAC ATCACCAAGA CGCGCGGAAC GGCGAAGCTG CGCATCGAAA TCGGCAATGA TGTCGGTGAT AGATGGAGCA ATGCGGACTA TGAAGAGCAT AAGGAGCGGC TTTTCGATTG GCTTGTGCAG GAGGGAGACC GCTGCTTGTA TATTTATGAC TTCGGCGACG ATTGGGAACA TGAGATCGTG CTTGAAAAAA TAGTGAAGCC GCAGCCAGAT CTCATCTATC CAGTTTGCCT CAAGGCGGTG CGTGTTGCGC CTGAAGAAGA CAGCATGGGC GAAGGGTGGA ATCCAGAGGC GATCGAGACA AAGGAATTGA CAGCAATGGT CAATGCAAAA CTGGCTCCAC TGAGCAAAAA AGTTGGCAAG GAAATCCAAA AGAAAGCGCG AAAAGAGATG GAGAAGGGAG CGCAGGCCAC ACAAGGCAAC GTATGGCGGG CACTGTTGGA AAAAGCGGTG GCGTTTAACC GGTTAGCGCC GTGGCAATGG ATGGATGATG ATGAAATTTT CCTTGTCATC GATCCGGAAA CGAACGAGCG CTTATATTGT TCCGTCATTG GCGCGCTTGG CCAAGAACAT GGCATGGTCG TATATATCGG GGAGCAAGGA TACAAAAGCC TGCAGCATTT ATTCAAACAG CCATATCCCG AACAAGATCC TGTTTATACA CAACGGGCGG TGCTCATTTC CTTTGCCGAC CGAAATGAGT TAAGCAAGGA AGACTATGAG CTCCTCCGCT CCCAAGGCAT GACGTTCCGC GGCAAAAAGC AGTGGCCGCA GTTTCGCAGC TTTGTCCCAG GGTACTATCC GTGGACGATT TCCGAAGAAG AGGCAAAATT GGTGACGGCG GCGCTTGATC AGGCGTTTGA TGTCGCGCGG CGCGCTGGGG AAGGGGAGCT TTCGCTTCCA GTGTTTCCGC AGGATGAAAA GATGTTTGCC CGCATCGGTG AAAAGAAGGA TGGGAATGTC GTTTGGCGCG ATGACCACGT TCCGCTTGCC GAGCTGGAAG CCGAGGAAAA AGCGCCGATG TATGAACTGC TTGTCGATCC GAAATTGATT AAAATGGTGA AAAATATTGG ACAAGTATAC CATGGCAGCA TCGAATTTGA CGCGGGGTAT ATCAACCGGC CGGTTCAAGA GAAGCGCGGG GAGCGCCCGT ATTTTCCGAC ATTTGTACTG GCGGTGGATG TGAACACTGG GTTTATCATC CATAACGATT TGCTTCCGAT CGAGAATGTG GCGATGCGCG TGCAAAAAAG CTTTTTGGAC ATGCTTCTGC GGCTCGGGAA AATACCGCGG GAAATCCGCA TGAAAAAAGA AACGAAGCAA ATGCTCGCCC CGGTGCTGCG CAAACTGCCG ATCCGGACGA TGGAAGTGCC GCGGACTCCT GCGTCCGAAC ATGTCCGCAG AACTTTTGAA ATGTTTTAG
|
Protein sequence | MIYQLKVQLK GIRPPVWRRL LVPGDMTFAE LHRVLQKAFD WEDRHLHTFY ITKTRGTAKL RIEIGNDVGD RWSNADYEEH KERLFDWLVQ EGDRCLYIYD FGDDWEHEIV LEKIVKPQPD LIYPVCLKAV RVAPEEDSMG EGWNPEAIET KELTAMVNAK LAPLSKKVGK EIQKKARKEM EKGAQATQGN VWRALLEKAV AFNRLAPWQW MDDDEIFLVI DPETNERLYC SVIGALGQEH GMVVYIGEQG YKSLQHLFKQ PYPEQDPVYT QRAVLISFAD RNELSKEDYE LLRSQGMTFR GKKQWPQFRS FVPGYYPWTI SEEEAKLVTA ALDQAFDVAR RAGEGELSLP VFPQDEKMFA RIGEKKDGNV VWRDDHVPLA ELEAEEKAPM YELLVDPKLI KMVKNIGQVY HGSIEFDAGY INRPVQEKRG ERPYFPTFVL AVDVNTGFII HNDLLPIENV AMRVQKSFLD MLLRLGKIPR EIRMKKETKQ MLAPVLRKLP IRTMEVPRTP ASEHVRRTFE MF
|
| |