Gene Dret_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0119 
Symbol 
ID8417923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp158436 
End bp159857 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content61% 
IMG OID645036684 
ProductAnthranilate synthase 
Protein accessionYP_003196999 
Protein GI258404257 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCAGG TGGTATTACA GCAACAAGGC CGGACTCTTC CGGCGGACAC ACAGACACCG 
ATCAGTCTGT ACCAGCAGAT GGTGGGCAAA AAACCAGGGC TGCTCCTGGA GAGTGCTGAA
GTCGACGGTC GTTGGGGGCG TTACAGCCTT GTGGCCTGGG ATTTTCGCCT TGTGGCCTCC
TGCCATGAGG GAAATCTCGA GCTGACGGTC AAGGATGCCC GGCTTGAGGC GCTCCGGAGC
TATACGGGGT TGCCTTTCGA ACAAGGATTG CGCGCGCTCC TGGCCGATCT GGCGGTACAG
CCTCCAGAGG AGCTAGCCGA TCTGCCCATT TTTTCCCGCG GTGTCTACGG ATATCTCGGC
TACGGCCTGG CCGGGTGTTT TGAGCCCGCC TTGGCCGACC AGCTCCCTCC TGAAAAGGCT
GAAGCCTGTC TGGTTCTGCC GGCCCATGTC CTTTTATTCG ACCATCTGCA CCACCGGTGC
GTTCAACTCA GCCTGGACGA CACCTTCCCC AAACACGGCG GCGGGCGGCA AGTTGGCGCC
AGCCTCGATG CCAAGCCGCG GCTGGGACAG GTGGAGACCA GACCGGACAA GGAACAATTC
TGTCAGAGCG TGCGTCGCAT CCGGGAGGAC ATCCACAATG GAGAGGCTAT CCAGGTCGTC
CTTTCGACCC GGTTTCAGGC CTCCTTCTCC GGAGAAGCCT TTGCTGTCTA CCGGCGCCTG
CGTCAGTACA ACCCTTCTCC CTATATGTAT TTTTTGCGCT TGCCGGGCAC GACTATCGTC
GGTTCGTCAC CGGAGGTCCT GGTGCGGTGT TCGGAGGGAC GGGTCGAGGA ATGCCCCATT
GCCGGGACCA GGCACCGCGG AACCACCCGG GAGGAAGATG CAGCCCTGGC CGACGAGTTG
GCGGCCGATC CCAAGGAGCG GGCCGAGCAC GTCATGCTTG TGGATTTGGG CCGCAATGAT
CTGGGCCGGA TCGCTGCAGC GGGCAGTGTC CGTGTCGATA GGCTCATGCA GGTCGAACGG
TTTTCCCATG TCATGCACCT GACCTCGTAT CTCGAGGCCG AGCTCAAGAC TGGCTTGGAC
GCCGTGGATG TCCTTGCGGC CACGTTTCCT GCCGGCACTG TTTCCGGAGC CCCGAAGATT
CGGGCTATGG AGACCATCGC AGAACATGAA AGCCAGCCCC GGGGGCCCTA CGCGGGGGCG
GTGGGCTGGA TCGGGCTTGA TCCGGATCAG GTCGCCCTGG ACACCGGGAT CTGTATCCGG
ACTTTGTGGA TCCAGTCCGG GACCATCTTC TGGCAGGCCG GGGCCGGCAT CGTGGCCGAC
TCGGATCCGG AAAAGGAATG GCAGGAATGC CAGAACAAGG CCCGCATTTT GCGGGAAGTC
CTTCAGGAAG AAGGGGAAAG TGATGTTTTT GCTCATCGAT AA
 
Protein sequence
MPQVVLQQQG RTLPADTQTP ISLYQQMVGK KPGLLLESAE VDGRWGRYSL VAWDFRLVAS 
CHEGNLELTV KDARLEALRS YTGLPFEQGL RALLADLAVQ PPEELADLPI FSRGVYGYLG
YGLAGCFEPA LADQLPPEKA EACLVLPAHV LLFDHLHHRC VQLSLDDTFP KHGGGRQVGA
SLDAKPRLGQ VETRPDKEQF CQSVRRIRED IHNGEAIQVV LSTRFQASFS GEAFAVYRRL
RQYNPSPYMY FLRLPGTTIV GSSPEVLVRC SEGRVEECPI AGTRHRGTTR EEDAALADEL
AADPKERAEH VMLVDLGRND LGRIAAAGSV RVDRLMQVER FSHVMHLTSY LEAELKTGLD
AVDVLAATFP AGTVSGAPKI RAMETIAEHE SQPRGPYAGA VGWIGLDPDQ VALDTGICIR
TLWIQSGTIF WQAGAGIVAD SDPEKEWQEC QNKARILREV LQEEGESDVF AHR