Gene Dret_2171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2171 
Symbol 
ID8420022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2468271 
End bp2469350 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content62% 
IMG OID645038765 
Productchorismate synthase 
Protein accessionYP_003199033 
Protein GI258406291 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGCA ACACCTTCGG GCATCTGTTC AGTCTGACCA CTTTCGGAGA ATCCCACGGC 
CCCGCCCTCG GCGGTGTGGT CCACGGCTGT CCCGCTGGCC TGCACCTGGA CGAAGCGGCG
GTCCAACAGG AACTCGACAG ACGCCGCCCG GGACAGGGCA AAACAAGCAC CCCCCGGCGG
GAGTCAGATA AAGTCCAACT CTTGTCTGGC ATCTTTGAAG GGGTAACCAC CGGCACCCCG
ATCGGCTTCA GTATTGCCAA TGAAAACCAG CGCACCTCGG ATTACGAGGC CATGCGCGGC
ATCTATCGCC CTGGGCACGC TGATTTCACC TACATGGCCA AATACGGGCA CCGCGACCAC
CGCGGCGGAG GGCGCTCCTC GGGCCGGGAA ACCGTGAGCC GTGTTGTTGG TGGCGCCATC
GCCCAGGTCT TCCTCGCCCA GCACAACATC CAGGCCCAGG CCTACACCCA GGAATTCGGT
GGCATCCAGG CCGAAACGAT CGCTCCGGAC AAAGCCCATG AATTGCCCTA CTTCGCTCCG
GATCCTTCGG TGGTCGAACT GTGGGATAAA CGGGTAAGTG AAATAAAAAA GGCGGGAGAT
ACCTTGGGCG GGATTGTGGA GATTCAAATC CACGGTGTTC CGCCCGGGTT GGGCGAACCG
GTTTTCGACA AGCTTGACGC CAGGCTCGCG GCTGCCTGCA TGTCCGTGGG AGCGGTGAAA
AGCGTTGAAA TCGGTTGCGG CCGTCAGGCG GCCCGTCTCA CGGGCAGCGA AAACAATGAA
CCGCCCGACC CGGCCCTGAC ACACCGCAAC AACGCCGGCG GCATCCTTGG GGGCATCTCC
AATGGAGCTC CGATCGTGCT CCGGGCCGCG GTCAAGCCCA TCCCCTCCAT TGCCCAGGAG
CAGGAGGTGG CCACCGCTGA AAAGACACTG GCTCCGTTCA CCATCGGCGG GCGACACGAC
ATCAGCGCCA TCCCACGCAT CGTACCGGTG TTAAAAGCCA TGGCCCTGCT CACCATAGCG
GACATGCTCC TGTTGCAGCG AAGCGCCCGG AGCGAAGGAT CACCTCCGGC CACCGTATAA
 
Protein sequence
MSGNTFGHLF SLTTFGESHG PALGGVVHGC PAGLHLDEAA VQQELDRRRP GQGKTSTPRR 
ESDKVQLLSG IFEGVTTGTP IGFSIANENQ RTSDYEAMRG IYRPGHADFT YMAKYGHRDH
RGGGRSSGRE TVSRVVGGAI AQVFLAQHNI QAQAYTQEFG GIQAETIAPD KAHELPYFAP
DPSVVELWDK RVSEIKKAGD TLGGIVEIQI HGVPPGLGEP VFDKLDARLA AACMSVGAVK
SVEIGCGRQA ARLTGSENNE PPDPALTHRN NAGGILGGIS NGAPIVLRAA VKPIPSIAQE
QEVATAEKTL APFTIGGRHD ISAIPRIVPV LKAMALLTIA DMLLLQRSAR SEGSPPATV