Gene Dret_0621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0621 
Symbol 
ID8418433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp745346 
End bp747073 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content49% 
IMG OID645037189 
Producttransposase IS4 family protein 
Protein accessionYP_003197496 
Protein GI258404754 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00154214 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000449276 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACACC TGCATAAAAA AGTTAAACAG GGCAAGCCTT ACTACTATAT CCGGGAGATG 
GCCCGGGTGA ACGGGAAGCC CAAAGTCGTT AACCAGATCT ATCTGGGGTC TGTTGAGCGC
ATCATGGAAA TGGCCATGGG CCAGGAAAAA GCTGACCTGA GCAGAATCCA AGTCCAGGAG
TTCGGTTCTC TTTTTCTGGC TAACCTCATG GAAAAACAAA TTGGGATCGT GGAGATTATC
GATTCAGTCA TCCCGCCAGA TCCCAGGGAT TCAGGTCCGA GTTTGGGAGA ATTTTTTCTG
TATGCGGCCT TTAACCGTAT GATCGACCCT TGCTCCAAGC ATAGCCTTTC GGATTGGTTG
AAAGATTTCG CTGTCCATCA GATTCGACCC GTGGATACCA GTGCCTTGAC TTCTCAGCGA
TATTGGAAGC GTTGGGAGCG TGTTGATCAG GAGTCTATTG AAGATATTTC CAAAGCTTTA
TTCAAAAAAG TTAGTGAATT AGATCCCATT AAATCGGATT GTTTTCTTTT CGATACCACC
AACTACTTCA ACTACATGGA CAGCAAGACT TCGTCCCAGC TAGCCCAGCG GGGGCGGAAC
AAAGACGGCA AAAATTGGCT CCGACAGGTC GGCTTGGCGT TGCTTGTCTC AAGGGGCTCG
CAGTTGCCTC TTTTTTATAA AGAATACGAG GGCAACTGCC ACGACTCCAA GCTGTTTAAC
CGGCTACTGG GAAACATTTT CGCCTCCTTG GAGGACCTGG GCCGAGGAAC CGATTCGTTG
ACCGTGGTCG TCGACAAGGG GATGAATTCC GAGGCGAATA TGCAGGCTAT CGACAAACAA
GCGAAAGTGG ACTTTGTGAC CACCTATTCT CCTGCCTTCG CTGAGGATTT AGCCCAAACC
GACCTGGAAA ATTTTGCTCC TGTTGATACC CGGAAAAACC GCGAACTCGC TGCAAGGGGT
CGCCAAGACG ATCAGATGGT GGCTTGGCGA ACCACTGGGT TCTTCTGGGG CGCCACCCGA
ACAGTGGTGG TTACCTACAA CCCCAGGACA GCGGCAAAAC AGCGGTATCG CTTCGATCAA
AAACTCGGCA AGCTCCAAAA TGGTCTTTTC GAGCTACGCG CTCGAGTCCG GGCAGGCAAT
AAGGCCTTGA AAAGCAAAGA GCAGGTTAAA GCCAGGTACA AAGACCTTTG CGAATCGCTA
TATCTTCCCA AGAATCTTTA CAAAATCGAA TTTGTGACCA CTGGCAAACA GCTGAAAATG
TATTTTCGCA AGGATCACTA TCAAATCAGC AAACATGTCA AACGCTTCGG GAAAAATATC
ATCATTACGT CACATGATGA CTGGGATAAA GAGGCCATTG TACAGGCCAG CCTGGATCGT
TACCAAGTCG AAAATGCCTT TCGCCAATCC AAGGGCAATG AATTCGGAAA TTTTCGTCCC
GTATGGCATT GGACGGATGG CAAGATCCGT TGCCACTTGT TCGCCTGCCT TATCGCACAG
ACCTATTTGC AGTTGATCGC CTTGCACCTG AAAAAAGCTG GACTGAATTA TTCCGTAGAT
CAAGCAATGA AATCAATGCG CAATCTTTCC AGCTGCTTAT GCTGGAGCAA GGGCAAGCGG
AAGCCAACTC GAATCATTGA AGAACCCAGT GAGGAGCAAG CGGCGATATT GAAGTCTTTT
GGCTACAAAA TTCGCAATGG GGTCTTAGAG CGCCGGTCAC ACCGGTAA
 
Protein sequence
MAHLHKKVKQ GKPYYYIREM ARVNGKPKVV NQIYLGSVER IMEMAMGQEK ADLSRIQVQE 
FGSLFLANLM EKQIGIVEII DSVIPPDPRD SGPSLGEFFL YAAFNRMIDP CSKHSLSDWL
KDFAVHQIRP VDTSALTSQR YWKRWERVDQ ESIEDISKAL FKKVSELDPI KSDCFLFDTT
NYFNYMDSKT SSQLAQRGRN KDGKNWLRQV GLALLVSRGS QLPLFYKEYE GNCHDSKLFN
RLLGNIFASL EDLGRGTDSL TVVVDKGMNS EANMQAIDKQ AKVDFVTTYS PAFAEDLAQT
DLENFAPVDT RKNRELAARG RQDDQMVAWR TTGFFWGATR TVVVTYNPRT AAKQRYRFDQ
KLGKLQNGLF ELRARVRAGN KALKSKEQVK ARYKDLCESL YLPKNLYKIE FVTTGKQLKM
YFRKDHYQIS KHVKRFGKNI IITSHDDWDK EAIVQASLDR YQVENAFRQS KGNEFGNFRP
VWHWTDGKIR CHLFACLIAQ TYLQLIALHL KKAGLNYSVD QAMKSMRNLS SCLCWSKGKR
KPTRIIEEPS EEQAAILKSF GYKIRNGVLE RRSHR