Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3859 |
Symbol | |
ID | 5541363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5042689 |
End bp | 5043765 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640895969 |
Product | transposase IS4 family protein |
Protein accession | YP_001433914 |
Protein GI | 156743785 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.631601 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACG ACAAACAACT CCTTGACCTG TACAGCGACT ATTTGATAAG CGCCTTCGGA CAAACCACGG CGACAGGGCT GTCATCCCTA TTGGATGGTG AAATCAGCCA TGACCGAGTG CAACGCTTGC TGGCAGGGAA AGAACAAACC TCGGCGGATT TGTGGCGGCT TGTCAAGCCA CATGTGCGCC AGATTGAAAG CGAAGACGGC GTGGTAATTG TGGATGACAG TATTGCGGAA AAACCATACA CCGACGAGAA CGACATCGTT TGCTGGCACT ACGATCATTC CCAGCAAAGA ACCGTCAAGG GCATCAACTT TGTGACCTGC CTATATCACA GCCAAGGCGT ATCGTTGCCC GTTGGGTTCG AACTGGTTCG GAAAACAGAA CGCTACACCG ACCCGAAGAC GGGAAAGGAA AAACGTCGCT CGGATAAAAC CAAGAATGAA ATGTACCGAG ACCTCCTACA ACAAGCCGTC AAAAACCAGA TTCCGTTCAA ATATGCGCTC AACGATATCT GGTTTGCTTC CGCCGAAAAC ATGAACTTTG TCAAACTCAC ACTCAAAAAG GAGTTCGTTA TGCCACTCAC AGGCAATCGC AAAGTGGCGC TGAGTGTGAA TGCCAAGCAG CCGGGACGTT ATCAGCGAGT GGACACGCTG GAACTGGAAC CGATGAAACC TGTCACGGTG TATTTGGAAG GCGTGGGGTT TGCGCTCCTT CTCATCAAAC AAGTCTTCAC AAACGAAGAT GGCTCGACAG GCATCCAATA TCTGGTCGCC AGCGATACCA CACGAGACGG TAACGGGATT GCCGCAATCT ATCACAAACG ATGGAACGTG GAACCCTATC ACAAGTCGCT CAAACAGAAT GCTTCGCTGG AGAAGTCACC CACCCAAACG GTGACAACTC AGACCAATCA TTTCTTTGCG GCTCTGTGCG GTTACATCAA ACTCGAACTG CTCAAAGGCG CCACCAAACT CAATCATTGT GCGCTCAAAT CCAAACTGTA CTTGCACGCT ATTCATGCCG CTTATGCCAA GTTGCAGGAA CTCAATCCTG TTCAACTGGC TGCGTAA
|
Protein sequence | MKNDKQLLDL YSDYLISAFG QTTATGLSSL LDGEISHDRV QRLLAGKEQT SADLWRLVKP HVRQIESEDG VVIVDDSIAE KPYTDENDIV CWHYDHSQQR TVKGINFVTC LYHSQGVSLP VGFELVRKTE RYTDPKTGKE KRRSDKTKNE MYRDLLQQAV KNQIPFKYAL NDIWFASAEN MNFVKLTLKK EFVMPLTGNR KVALSVNAKQ PGRYQRVDTL ELEPMKPVTV YLEGVGFALL LIKQVFTNED GSTGIQYLVA SDTTRDGNGI AAIYHKRWNV EPYHKSLKQN ASLEKSPTQT VTTQTNHFFA ALCGYIKLEL LKGATKLNHC ALKSKLYLHA IHAAYAKLQE LNPVQLAA
|
| |