Gene Strop_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1231 
Symbol 
ID5057681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1389105 
End bp1391021 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content70% 
IMG OID640473500 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001158079 
Protein GI145593782 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00143562 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.953695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCATCT GTTGGGACAC GTCCTGCGCG GGCTCGGGTC GAACTCGCGA CGGTGTCAGA 
CTGGTATGCA TGCCTGAGCT GCGGTCGAGG ACCTCCACCC ACGGTCGGAC GATGGCGGGC
GCCCGAGCCC TGTGGCGGGC CACCGGGATG ACCGACGACG ACTTCGGCAA GCCGATCGTC
GCCATCGCCA ACAGCTTCAC CCAGTTCGTT CCGGGGCATG TCCACCTCAA GGAGCTCGGT
GGCCTGGTGG CCGAGGCGGT AGCCGATTCC GGCGGGGTGG GTCGGGAGTT CAACACCATC
GCCGTGGACG ACGGCATCGC GATGGGCCAC GGCGGCATGC TCTACTCGCT GCCAAGCCGG
GAACTGATCG CCGACGCCGT GGAGTACATG GTCAATGCCC ACTGCGCCGA CGCCCTGGTC
TGCATCTCCA ACTGCGACAA GATCACTCCC GGGATGCTGC TGGCCGCGCT GCGGCTGAAC
ATCCCAACTG TCTTCGTCTC CGGCGGCCCG ATGGAGGCCG GCAAGACCGT CGCGATCGAG
GGGGTCGTAC ACTCCAAGAT CGACCTGATC GATGCGATGA TCGCTGCGTC CAACGAGGCG
GTCACCGACG ACCAGCTCGG CCAGATCGAA CGCTCGGCCT GCCCCACCTG CGGCTCCTGC
TCCGGCATGT TCACCGCCAA CTCGATGAAC TGCCTCACCG AGGCGATCGG CCTGGCCCTT
CCCGGCAACG GGTCAACGCT GGCGACCCAC GCCGCCCGCC GGTCACTCTT CGTCGAGGCC
GGCCGCACCG TCGTCGAGAT CGCCAAGCGT TGGTACGACG GGGACGACGC CACGGTGCTG
CCCCGTGCGG TAGCCAACCG CGCCGCCTTC GACAACGCGG TCGCCCTCGA CGTCGCGATG
GGCGGTTCGA CGAACACCAT CCTGCACCTG CTGGCCGCCG CCCGCGAGGC CGAGCTGGAC
TTCGGGGTGG TGGACATCGA CGCCATCTCC CGGCGGGTGC CCTGCCTGGC GAAGGTCGCA
CCGAACTCTC CCCACTACCA CATGGAGGAC GTCCACCGGG CCGGTGGCAT CCCGGCCATC
CTCGGTGAGC TGGACCGCGC CGGCCTACTC AACCGGGAGG TCCACGCGGT GCACTCCCCC
TCGCTGGAGC GCTGGCTCGC CGACTGGGAC GTTCGGGGTG GCACCGCGAC ACCGACGGCG
GTCGAGCTGT TCCATGCCGC ACCGGGCGGG GTCCGCACCG TCGAGCCGTT CTCCACCACC
AACCGCTGGT CGACACTGGA CACAGATGCG GCCGACGGCT GCGTACGGGA GCGGGCCCAC
GCGTACACCG CGGACGGAGG GCTGGCCATC CTGCACGGCA ACCTGGCACC GGAGGGCTGC
GTGGTGAAGA CCGCCGGGGT ACCCGAGGAG TGCCTGACCT TCCGCGGCCC CGCCAAGGTC
TACGAGTCCC AGGACGACGC GGTCACCGCC ATCCTGGCCA AGGAGGTCGT CGCCGGCGAC
GTGGTGGTGA TCCGCTACGA GGGCCCCCGG GGTGGGCCCG GGATGCAGGA GATGCTCTAC
CCCACCTCGT TCCTCAAGGG CCGAGGGCTG GGGCGGGCCT GCGCGCTACT GACCGACGGC
CGCTTCTCCG GCGGCACCTC CGGACTGTCC GTCGGGCACG TCTCCCCGGA GGCCGCCGCC
GGTGGGCTGA TCGCCCTGGT CGAACCGGGC GACGAGATCG TCATCGACAT CCCGAACCGG
GCCATCGAAT TGGCCGTACC GGCCGAGGTG TTGGACGCCC GCCGGGTCGC ACAGGAGAAG
CGAGACCGCC CGTACACGCC GGCGGAGCGG CAGCGCCCCG TCTCCGCAGC GCTGCGCGCG
TACGCCGCCA TGACCACCTC GGCCAGCGAC GGCGCCTACC GCCGCGTCCC CGAGTGA
 
Protein sequence
MCICWDTSCA GSGRTRDGVR LVCMPELRSR TSTHGRTMAG ARALWRATGM TDDDFGKPIV 
AIANSFTQFV PGHVHLKELG GLVAEAVADS GGVGREFNTI AVDDGIAMGH GGMLYSLPSR
ELIADAVEYM VNAHCADALV CISNCDKITP GMLLAALRLN IPTVFVSGGP MEAGKTVAIE
GVVHSKIDLI DAMIAASNEA VTDDQLGQIE RSACPTCGSC SGMFTANSMN CLTEAIGLAL
PGNGSTLATH AARRSLFVEA GRTVVEIAKR WYDGDDATVL PRAVANRAAF DNAVALDVAM
GGSTNTILHL LAAAREAELD FGVVDIDAIS RRVPCLAKVA PNSPHYHMED VHRAGGIPAI
LGELDRAGLL NREVHAVHSP SLERWLADWD VRGGTATPTA VELFHAAPGG VRTVEPFSTT
NRWSTLDTDA ADGCVRERAH AYTADGGLAI LHGNLAPEGC VVKTAGVPEE CLTFRGPAKV
YESQDDAVTA ILAKEVVAGD VVVIRYEGPR GGPGMQEMLY PTSFLKGRGL GRACALLTDG
RFSGGTSGLS VGHVSPEAAA GGLIALVEPG DEIVIDIPNR AIELAVPAEV LDARRVAQEK
RDRPYTPAER QRPVSAALRA YAAMTTSASD GAYRRVPE