Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0621 |
Symbol | |
ID | 8418433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 745346 |
End bp | 747073 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 645037189 |
Product | transposase IS4 family protein |
Protein accession | YP_003197496 |
Protein GI | 258404754 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00154214 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00000449276 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCACACC TGCATAAAAA AGTTAAACAG GGCAAGCCTT ACTACTATAT CCGGGAGATG GCCCGGGTGA ACGGGAAGCC CAAAGTCGTT AACCAGATCT ATCTGGGGTC TGTTGAGCGC ATCATGGAAA TGGCCATGGG CCAGGAAAAA GCTGACCTGA GCAGAATCCA AGTCCAGGAG TTCGGTTCTC TTTTTCTGGC TAACCTCATG GAAAAACAAA TTGGGATCGT GGAGATTATC GATTCAGTCA TCCCGCCAGA TCCCAGGGAT TCAGGTCCGA GTTTGGGAGA ATTTTTTCTG TATGCGGCCT TTAACCGTAT GATCGACCCT TGCTCCAAGC ATAGCCTTTC GGATTGGTTG AAAGATTTCG CTGTCCATCA GATTCGACCC GTGGATACCA GTGCCTTGAC TTCTCAGCGA TATTGGAAGC GTTGGGAGCG TGTTGATCAG GAGTCTATTG AAGATATTTC CAAAGCTTTA TTCAAAAAAG TTAGTGAATT AGATCCCATT AAATCGGATT GTTTTCTTTT CGATACCACC AACTACTTCA ACTACATGGA CAGCAAGACT TCGTCCCAGC TAGCCCAGCG GGGGCGGAAC AAAGACGGCA AAAATTGGCT CCGACAGGTC GGCTTGGCGT TGCTTGTCTC AAGGGGCTCG CAGTTGCCTC TTTTTTATAA AGAATACGAG GGCAACTGCC ACGACTCCAA GCTGTTTAAC CGGCTACTGG GAAACATTTT CGCCTCCTTG GAGGACCTGG GCCGAGGAAC CGATTCGTTG ACCGTGGTCG TCGACAAGGG GATGAATTCC GAGGCGAATA TGCAGGCTAT CGACAAACAA GCGAAAGTGG ACTTTGTGAC CACCTATTCT CCTGCCTTCG CTGAGGATTT AGCCCAAACC GACCTGGAAA ATTTTGCTCC TGTTGATACC CGGAAAAACC GCGAACTCGC TGCAAGGGGT CGCCAAGACG ATCAGATGGT GGCTTGGCGA ACCACTGGGT TCTTCTGGGG CGCCACCCGA ACAGTGGTGG TTACCTACAA CCCCAGGACA GCGGCAAAAC AGCGGTATCG CTTCGATCAA AAACTCGGCA AGCTCCAAAA TGGTCTTTTC GAGCTACGCG CTCGAGTCCG GGCAGGCAAT AAGGCCTTGA AAAGCAAAGA GCAGGTTAAA GCCAGGTACA AAGACCTTTG CGAATCGCTA TATCTTCCCA AGAATCTTTA CAAAATCGAA TTTGTGACCA CTGGCAAACA GCTGAAAATG TATTTTCGCA AGGATCACTA TCAAATCAGC AAACATGTCA AACGCTTCGG GAAAAATATC ATCATTACGT CACATGATGA CTGGGATAAA GAGGCCATTG TACAGGCCAG CCTGGATCGT TACCAAGTCG AAAATGCCTT TCGCCAATCC AAGGGCAATG AATTCGGAAA TTTTCGTCCC GTATGGCATT GGACGGATGG CAAGATCCGT TGCCACTTGT TCGCCTGCCT TATCGCACAG ACCTATTTGC AGTTGATCGC CTTGCACCTG AAAAAAGCTG GACTGAATTA TTCCGTAGAT CAAGCAATGA AATCAATGCG CAATCTTTCC AGCTGCTTAT GCTGGAGCAA GGGCAAGCGG AAGCCAACTC GAATCATTGA AGAACCCAGT GAGGAGCAAG CGGCGATATT GAAGTCTTTT GGCTACAAAA TTCGCAATGG GGTCTTAGAG CGCCGGTCAC ACCGGTAA
|
Protein sequence | MAHLHKKVKQ GKPYYYIREM ARVNGKPKVV NQIYLGSVER IMEMAMGQEK ADLSRIQVQE FGSLFLANLM EKQIGIVEII DSVIPPDPRD SGPSLGEFFL YAAFNRMIDP CSKHSLSDWL KDFAVHQIRP VDTSALTSQR YWKRWERVDQ ESIEDISKAL FKKVSELDPI KSDCFLFDTT NYFNYMDSKT SSQLAQRGRN KDGKNWLRQV GLALLVSRGS QLPLFYKEYE GNCHDSKLFN RLLGNIFASL EDLGRGTDSL TVVVDKGMNS EANMQAIDKQ AKVDFVTTYS PAFAEDLAQT DLENFAPVDT RKNRELAARG RQDDQMVAWR TTGFFWGATR TVVVTYNPRT AAKQRYRFDQ KLGKLQNGLF ELRARVRAGN KALKSKEQVK ARYKDLCESL YLPKNLYKIE FVTTGKQLKM YFRKDHYQIS KHVKRFGKNI IITSHDDWDK EAIVQASLDR YQVENAFRQS KGNEFGNFRP VWHWTDGKIR CHLFACLIAQ TYLQLIALHL KKAGLNYSVD QAMKSMRNLS SCLCWSKGKR KPTRIIEEPS EEQAAILKSF GYKIRNGVLE RRSHR
|
| |