Gene Dret_1883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1883 
Symbol 
ID8419726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2162653 
End bp2164083 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content49% 
IMG OID645038469 
Producthypothetical protein 
Protein accessionYP_003198745 
Protein GI258406003 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0565061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00344442 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGCA TTGCATTGCT CGCCTTGGTC GCGGTTTTCG TACTGGGCAT GGCTGGCAGC 
GCCTCCGCCC TGCACCTGCA GCAGGGTGAA GCCGGCGAAG CCGGTGGTGG CACCGCCTAC
ATGAACATTT ACGGCCACGT GGCCAACTAC GCTTACTGGG AAAACAACAC TGACTTTAAC
GACAACGAAT ATGATTCTGC CGATTTCTTT GGAGAAACCG ACTGGTACCT CGGCTTTGAA
ATTGCTCAAA GCGAAAACCT GAAAGCCTTC ATCCAACTCT ACCAAGAACA AGACTGGGGT
GTTGGACCAG GTGAAGCCGG CGAAGATAGC ACCGAAAATT TGAAAATTGA ACACGCCTAC
TTGGATTTCA TGGTCCCGAA TACGGACATT CGTGTAAAAG GCGGTGATTT TGGTTTTCAA
CTCCCTGGAG CCGGTGGGGA CAACGTGTTT GATGACACCG CCAATGGCAT CCAGGTTTCC
ATGCCGTTTA ATGACATGGT TGGACTGACC GCCGGTTACG CCATGTTGGA TCAGCAAGGA
TCTGCCTGGA ACGCGACTGA TGCTGGCGAT GACAATGAAT TCACCGCGCA AACAAGTGCT
GTGTACGCAG TATTGCCCAT CACCATGGAT GGCGTTGCCT TCAATCCGTA TTTCATGTTC
GCTGAGTCTG ACATTGGCGA AGAATCTGGC AGTTTAGTGC CGTTCTATGA CGAAGACGGA
AGCCCTTTTG GGGAAATTGA TGATCACACC GCCTATTGGT ATGGTTTTAA CGGCCAAGTG
ACCATGTTTG ATCCTATCAC CATTGGTGCG GATTTCATTT ATGGAACATC CAGTGTTGAA
ACTAAAGATA ATTTCACTGT AGAAGGTGAA GATGCTGAAG ACGCCGCCGA ACGTGCTGGC
TGGTTCACTC AAATCACTGC TGACTACAAA ATGGACATGT TTACTCCTGG AGTAGTGTTC
TTCTACGGAA GCGGCGATGA TGATGATCCG AGCGATGGTA GTGAAATGTT GCCCACCGTT
GTGTCTAACT ATAAGCCGTT TAACGCTGTA GACGACTTTG GTAACGGCTA TGTTACTAAC
GCCGATGGAG ACACACAGGC TGAAGTTGGT GCCGGCCTTT GGGCTTTGGC TTTTCAACTG
AAAGACATCT CTTTTATCGA GAAACTTTCT CACGATCTGA CCATTGGTTA TGTTGCAGGC
ACCTCCGACA AAGAAGCTGC TGACTTCTAT GGTGACAGTT TCATGACCGA AGAAGACTCC
ATGTGGACCG TTGAATTCGA TAACTCCTAC CAGGTTTACG AAAATCTGTC CGCTGGCCTG
TTCTTTGCCT ACACCAAGGC CGACTACGAC GAAGATCTTG GTGACCGTGG CGATGACTTC
GCTGACGATG CCTACACCAC AATGGAAGCC AGCTTGAGCT ATAGCTTCTA G
 
Protein sequence
MKRIALLALV AVFVLGMAGS ASALHLQQGE AGEAGGGTAY MNIYGHVANY AYWENNTDFN 
DNEYDSADFF GETDWYLGFE IAQSENLKAF IQLYQEQDWG VGPGEAGEDS TENLKIEHAY
LDFMVPNTDI RVKGGDFGFQ LPGAGGDNVF DDTANGIQVS MPFNDMVGLT AGYAMLDQQG
SAWNATDAGD DNEFTAQTSA VYAVLPITMD GVAFNPYFMF AESDIGEESG SLVPFYDEDG
SPFGEIDDHT AYWYGFNGQV TMFDPITIGA DFIYGTSSVE TKDNFTVEGE DAEDAAERAG
WFTQITADYK MDMFTPGVVF FYGSGDDDDP SDGSEMLPTV VSNYKPFNAV DDFGNGYVTN
ADGDTQAEVG AGLWALAFQL KDISFIEKLS HDLTIGYVAG TSDKEAADFY GDSFMTEEDS
MWTVEFDNSY QVYENLSAGL FFAYTKADYD EDLGDRGDDF ADDAYTTMEA SLSYSF