Gene Dret_1775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1775 
Symbol 
ID8419616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2044655 
End bp2045869 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content60% 
IMG OID645038359 
ProductHipA N-terminal domain protein 
Protein accessionYP_003198637 
Protein GI258405895 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.474256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGG CCGTCAATGT CTATTTGGAC GGGTTTTTTG TTGGCAATGC TTGGCGGGAC 
GGCGACGGAC AACTCCTGTT CAAGTATTAT CCAGGATGGC TGGAATCGTC TGGGAGACAA
CCGCTGTCGC TGAGGTTGCC GTTGCGCCCG GATGTCTACG GCGATGCAGT AGTCCGCCCG
TTTTTGAGTG GTATCTTGCC GGAGCAGGGG GCGCGCCATC GTCTGGCGCA ATGTCTGGAC
ATTGACGACA GCGATATCTT GGGCCTTCTC GCGGCTATCG GGGGGGATTG CCCGGGGCGC
ATTTCCTTTT CGCTGCCCCA GCACGGCCCC GAGCACGTTC CACCGGGACA ACGGCCCCTG
GACGATCGGA TTCTGACGGC GCTTCCCGAG GTGCTGGCCG AGTGTCCGTT TTTGGCCGGG
GAACAATCGT TGCGTTTGTG CCTTCCCGGT AGCGGGAGGG CCTTGCCGGT GGTCCAGGAG
AGAAACCGGT TTTCTTTGCC GCTGGGCGAA CGTTCCAGCA CACATGTCCT GAAAACGGCC
CCTTTTGATC GCCAAGCGGG TGTGGCTAAT GAAGCCTTGT GCCTCGCGCT TGCCGCCAAG
GCTGGAGTGC CGGTCCAAGC GAGTATGCAG GTGGCCACCT CCCGCGAACC GCTTCTGCTG
GTGCAACGAG CTGACAGATC AGCCGTTGGT GCAGAAGGGA TTCAAAGATT GGGGGCCGAG
AGCCTCGGAC AAGCCCTGGG GATCGCGAAC GGGATACCGA TCCAGGACAG CAGGTTTTTT
TTGCAGACGG GATTCCAGCT TTTGAAACAG GTCGGCGTGG CGCCTATCCG GGATCAAAAA
CGGCTTCTGC AATGGAGCGC ATTGCAATGG GTGTTCGGGA ACGAGATGTT GCCTGTCGAA
AATATTACAT TGCTCAGGCA GGGGCAGGGA TGGGGTGTGG CCCCGTTTTA CGGTTTGGTC
TGCAAGGCGG AGCATTGGGA GCCCTCTGCG GCTGCAGCGA GCGATGAGGG GGCAGAATGG
CTGGAACATC GGCCCTGCGT GGCCTGGGCC GAGGCCACGG CTGTGCCGGA AAAAACAGTG
GGCGCGATTT TTGATCAGGT AGCGCGTGGG ATACTCAAAT ATGTCGATGA GGCGGTCGGG
CAGGTCGGTG ATCCCGAGCG GACACGGACT TTGGCCGCGT ATTTGCGCGC CAGGGCGGAA
AAAGGCTCCC GCTGA
 
Protein sequence
MTVAVNVYLD GFFVGNAWRD GDGQLLFKYY PGWLESSGRQ PLSLRLPLRP DVYGDAVVRP 
FLSGILPEQG ARHRLAQCLD IDDSDILGLL AAIGGDCPGR ISFSLPQHGP EHVPPGQRPL
DDRILTALPE VLAECPFLAG EQSLRLCLPG SGRALPVVQE RNRFSLPLGE RSSTHVLKTA
PFDRQAGVAN EALCLALAAK AGVPVQASMQ VATSREPLLL VQRADRSAVG AEGIQRLGAE
SLGQALGIAN GIPIQDSRFF LQTGFQLLKQ VGVAPIRDQK RLLQWSALQW VFGNEMLPVE
NITLLRQGQG WGVAPFYGLV CKAEHWEPSA AAASDEGAEW LEHRPCVAWA EATAVPEKTV
GAIFDQVARG ILKYVDEAVG QVGDPERTRT LAAYLRARAE KGSR