Gene Dret_1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1473 
Symbol 
ID8419302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1702497 
End bp1704506 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content60% 
IMG OID645038048 
ProductTonB-dependent receptor 
Protein accessionYP_003198338 
Protein GI258405596 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00220648 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACTCAC GATTGAGCAA GATCGCGCTC ATGGCCTTCG GGGTTTGCCT CTGGGCCGGC 
ATTGGGGTGG CCGGAGCCCA GGAGACTCCG GCGAACGACC AAGTCTTTGA ACTCGGGGAG
ATCGTTGTCT CCAGCCCGGC CTCCCAGGTC GAAGCCGCTG GCAGCGTCGA TGTCATCACC
GCCGCGGACA TCAAAGAGGA CAATGCCCGA ACCCTGGACG AAGCCCTGGA TCTCGTCCCT
GGCATCTATG TCCGCCGTGG CAAGAAGGGC GTCCCGCGGA TCGACATGCG CGGCATGCGC
ACCCGGCAGA TTGAGCTGCT TCTCAACGGC ATCCCCATCA ATTCCAGTTA CGACAATCAG
TTTGACCCCA GTTTCATTCC GGTCGAGAAC ATCGCCCGGA TCAAGGTCAC CCGCGGGGCC
GGTTCGGTCC TTTACGGCTC CGGGGGCAAC GCCGGGGTGA TCAATATCAT TACCAAACGC
GGCGCCGAGG GCGTCCACGG TTCGCTCAAC GGCGAAGCCG CCCAGGGCGA CGCCTACCTC
GGCCGCGGCA CCCTGTCCGC GCGCAATGAA AACGGCAACA TCTTCATCAG TGGCAGCACC
TACAACCAGG AATACTTTCC CCTGGCCGAC GGCGCGGACG TCGATTCCCG TCTCGAAGGC
GGTGACGAGC GGGAAAACAG CGACCGCGAA AGCAGCCACC TCTTTACCAG CATGGAACTC
ACGCCCACGG ACCAAACCAC CCTGGCCCTG AACCTGGAAA TGCAGCAGGG CGAATACGGC
CGACCCCATG AAGTCCTGGT CGATGAATTC ACGGACAAAA AATACGAAAA TGACGACCCC
ACACGACTCA CCTTCGAGCG GGTCACTGAC CGCTCCAGCA TCGGCGGCCA ACTCGCCCTG
AGCCATGACC TCTCCGGGCC GCTTTCCTTC AAGACCTGGG GGTACGCCAC CCAACTCGAG
CAGACCGTGG ATCGCTTCGA CGACAAGACC TACACGACCC AGGACAAAGG GCGGCACACG
GAAAGCACCA CCTCGCGCTA CGGGTTGGCC GGCCAGGTGG CCCTGGATCT CCACCGTCTC
GGCGTGGCCA CCCTCGCTGC CTCTGCAGAG CAGGGGAATT GGGACAGCCG GGAGGAGAAG
TATGAAAAAG GGGCCATTGA TGAACTCAAT GAAATAGACA AGAACAACAA CATCTATTCC
ACCTCCTTGC AGTACGAGGT CGCCCCGACC AATGCCCTCG GTCTTGTCGC CGGTGCGGGC
TGGCACCAAC AGGAAAAGCA CAACGGCAGT ACCGGAGCCG ACTTCAGTTA CCAACTCGGC
GGTCACTACG ATCTGACCGC AACCACTCGC CTCAAGGCCA ACCACGCCCG GAAAATCCGC
TTTCCCTCCC TGCGTCGCCT GTATGAGGGC GATGACGCCA ACCCGGACCT GGAAGCCGAA
GTCACCTGGC ACTACGAGGC TGGCGTGGAA CAGGATATCC CCGGGTGGCA GACCAAGCTC
GGCCTGACCC TCTTTCACAT CGACGCCGAA GATTTTATCG AAAAAATTGG GGATGAACCC
TATAAAAACC AGGACAAATA CCGCTTCCAG GGCATCGAGG CGACCCTGGC CAACGAATCC
GTGGACAATC TCCGCCTGGA GGCGGCCTAC ACCTACCTCC AATCGGAAAA CCGCTCCGAC
GAAAGAGATT CGGACAAACT CCAGAACCGC CCGGAAAACA AGGTTTCGCT GAAGGCGACC
TACACCTGCC CCTGGGACCT CAAAATCTCA GGCTCCTACC TCTATGTGGG CGAGCGGTAC
GACTTTTCCA AGGATGAAGA CAGTCCTGGC AAGACCATCA CCCTCGACCC TTACCAGGTC
GTCAACCTCA AGCTGTCCAA GCCCCTGCCG AAGACCGGGT GGGAATTCTA TGCCGGTGTG
GACAACCTCT TTGACGAGGC CTACGCCGAG AACTACGCTT TGCCCCGTCC CGGACGCACC
GTGTACGGCG GTGTGGAATA CAGCTTCTAG
 
Protein sequence
MHSRLSKIAL MAFGVCLWAG IGVAGAQETP ANDQVFELGE IVVSSPASQV EAAGSVDVIT 
AADIKEDNAR TLDEALDLVP GIYVRRGKKG VPRIDMRGMR TRQIELLLNG IPINSSYDNQ
FDPSFIPVEN IARIKVTRGA GSVLYGSGGN AGVINIITKR GAEGVHGSLN GEAAQGDAYL
GRGTLSARNE NGNIFISGST YNQEYFPLAD GADVDSRLEG GDERENSDRE SSHLFTSMEL
TPTDQTTLAL NLEMQQGEYG RPHEVLVDEF TDKKYENDDP TRLTFERVTD RSSIGGQLAL
SHDLSGPLSF KTWGYATQLE QTVDRFDDKT YTTQDKGRHT ESTTSRYGLA GQVALDLHRL
GVATLAASAE QGNWDSREEK YEKGAIDELN EIDKNNNIYS TSLQYEVAPT NALGLVAGAG
WHQQEKHNGS TGADFSYQLG GHYDLTATTR LKANHARKIR FPSLRRLYEG DDANPDLEAE
VTWHYEAGVE QDIPGWQTKL GLTLFHIDAE DFIEKIGDEP YKNQDKYRFQ GIEATLANES
VDNLRLEAAY TYLQSENRSD ERDSDKLQNR PENKVSLKAT YTCPWDLKIS GSYLYVGERY
DFSKDEDSPG KTITLDPYQV VNLKLSKPLP KTGWEFYAGV DNLFDEAYAE NYALPRPGRT
VYGGVEYSF