Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_2171 |
Symbol | |
ID | 8420022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 2468271 |
End bp | 2469350 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645038765 |
Product | chorismate synthase |
Protein accession | YP_003199033 |
Protein GI | 258406291 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGCA ACACCTTCGG GCATCTGTTC AGTCTGACCA CTTTCGGAGA ATCCCACGGC CCCGCCCTCG GCGGTGTGGT CCACGGCTGT CCCGCTGGCC TGCACCTGGA CGAAGCGGCG GTCCAACAGG AACTCGACAG ACGCCGCCCG GGACAGGGCA AAACAAGCAC CCCCCGGCGG GAGTCAGATA AAGTCCAACT CTTGTCTGGC ATCTTTGAAG GGGTAACCAC CGGCACCCCG ATCGGCTTCA GTATTGCCAA TGAAAACCAG CGCACCTCGG ATTACGAGGC CATGCGCGGC ATCTATCGCC CTGGGCACGC TGATTTCACC TACATGGCCA AATACGGGCA CCGCGACCAC CGCGGCGGAG GGCGCTCCTC GGGCCGGGAA ACCGTGAGCC GTGTTGTTGG TGGCGCCATC GCCCAGGTCT TCCTCGCCCA GCACAACATC CAGGCCCAGG CCTACACCCA GGAATTCGGT GGCATCCAGG CCGAAACGAT CGCTCCGGAC AAAGCCCATG AATTGCCCTA CTTCGCTCCG GATCCTTCGG TGGTCGAACT GTGGGATAAA CGGGTAAGTG AAATAAAAAA GGCGGGAGAT ACCTTGGGCG GGATTGTGGA GATTCAAATC CACGGTGTTC CGCCCGGGTT GGGCGAACCG GTTTTCGACA AGCTTGACGC CAGGCTCGCG GCTGCCTGCA TGTCCGTGGG AGCGGTGAAA AGCGTTGAAA TCGGTTGCGG CCGTCAGGCG GCCCGTCTCA CGGGCAGCGA AAACAATGAA CCGCCCGACC CGGCCCTGAC ACACCGCAAC AACGCCGGCG GCATCCTTGG GGGCATCTCC AATGGAGCTC CGATCGTGCT CCGGGCCGCG GTCAAGCCCA TCCCCTCCAT TGCCCAGGAG CAGGAGGTGG CCACCGCTGA AAAGACACTG GCTCCGTTCA CCATCGGCGG GCGACACGAC ATCAGCGCCA TCCCACGCAT CGTACCGGTG TTAAAAGCCA TGGCCCTGCT CACCATAGCG GACATGCTCC TGTTGCAGCG AAGCGCCCGG AGCGAAGGAT CACCTCCGGC CACCGTATAA
|
Protein sequence | MSGNTFGHLF SLTTFGESHG PALGGVVHGC PAGLHLDEAA VQQELDRRRP GQGKTSTPRR ESDKVQLLSG IFEGVTTGTP IGFSIANENQ RTSDYEAMRG IYRPGHADFT YMAKYGHRDH RGGGRSSGRE TVSRVVGGAI AQVFLAQHNI QAQAYTQEFG GIQAETIAPD KAHELPYFAP DPSVVELWDK RVSEIKKAGD TLGGIVEIQI HGVPPGLGEP VFDKLDARLA AACMSVGAVK SVEIGCGRQA ARLTGSENNE PPDPALTHRN NAGGILGGIS NGAPIVLRAA VKPIPSIAQE QEVATAEKTL APFTIGGRHD ISAIPRIVPV LKAMALLTIA DMLLLQRSAR SEGSPPATV
|
| |