Gene Strop_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1036 
Symbol 
ID5057482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1175301 
End bp1176692 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content67% 
IMG OID640473305 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_001157888 
Protein GI145593591 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.479807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.750222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATGGAA CGCGCATGCG ACCCGGCGTG GAGCGGTCGA TCCGCCATTC GCCACCAGCG 
ACGAACGATG GCGACAGAGA ACTCTGCCTA GACCGGTGGC GCGAGCTACC CCGGCGGCAG
GTGCCACCCT GGCCCGATCC CGCCGAGGTG GCAGCAGTGT GCGCGACACT CGGCAAAATG
CCGCCGATCG TCACACCCTA CGAGGTCGAT GAACTGCGTC ACCGCCTGGC CGAGGTATGC
GAAGGACGCG CCTTCCTGTT ACAGGGCGGC GACTGCGCCG AGACGTTCAC CGGAAACACT
GAGAGCCACC TGCTGGGCAC CACGCGGACG CTGCTGCAGA TGGCGATGGC GATAACGTAC
GGCGGTTCCG TTCCGGTGGT GAAGGTGGCG CGCCTCGCCG GCCAGTACGG GAAGCCCCGA
TCCTCGGCGA CCGATTCCCT GGGGTTGCCC GCGTACCGTG GCGATATTAT CAATGCGCGG
CACCCGGCCG AGTCGGCGCG GGCCGCCGAT CCGCAGCGCA TGATCGATGC GTACGCGAAT
TCCGCGGTCG CGATGAACCT CATCCGCGCC TATCCGCCGG ACGATCTGAC CGACCTCGAA
GAGCTGTACG ACGATACCTA CGACCTCATC CGCGCTTCCC CGGCCGGAGC CCGGTACCAG
GTCATCTCCG GCGAGATCGA CCGGGCGCGC GGCTTCGTCC GCGCGTGGGG GCCGAGCGAG
CGCCACGCGT TGCGGGAATC AAAGGTGTAC TGCTCGCACG AGGCGCTGGT GCTCGAGTAC
GACCGGGCGC TGACCCGAAT CAACGACGGC CGGGCATACG CGTTGTCGGG TCACTTTCTG
TGGGTCGGCG AACGCACCCG CCAGCTTGAC CACGCGCACG TCGACTTTGT CGCGCGTATC
GCCAACCCAA TCGGCGTGAA GCTCGGCCCG GCCGCCAGCC CGCATGCTGC GATCGAGCTG
TGCGAGCGGC TCAACCCCGA GAACCTGCCC GGGCGGCTCA CCCTGATCAG CCGCATGGGC
AACCGCCAGG TACGTGACGT GTTTCCGGCG ATCGTCGACA AGGTCACGGC CGCGGGCGCC
AAGGTCGTTT GGCAGTGCGA TCCGATGCAC GGCAACACCG AGCAGTCCTC ACACGGCTTC
AAGACCCGGC GCCTCGACCG GGTCGTTGAC GAGTTGCTGG GCTACTTCGA CGTGCACCGC
AGTCTGGGCA CCCATCCGGG AGGCGTCCAC GTCGAGCTCA CCGGTGAGAA CGTCACCGAG
TGCCTCGACG GGATACGTGG CGTCGAGGAC CAACACCTAC CGGATCGCTA CGAGACTGCC
TGCGACCCGC GGCTGAACAT GCGGCAGAGC TTAGAACTCG CGTTGCTGGT CGCCGAGATT
CTACGCGGCT GA
 
Protein sequence
MYGTRMRPGV ERSIRHSPPA TNDGDRELCL DRWRELPRRQ VPPWPDPAEV AAVCATLGKM 
PPIVTPYEVD ELRHRLAEVC EGRAFLLQGG DCAETFTGNT ESHLLGTTRT LLQMAMAITY
GGSVPVVKVA RLAGQYGKPR SSATDSLGLP AYRGDIINAR HPAESARAAD PQRMIDAYAN
SAVAMNLIRA YPPDDLTDLE ELYDDTYDLI RASPAGARYQ VISGEIDRAR GFVRAWGPSE
RHALRESKVY CSHEALVLEY DRALTRINDG RAYALSGHFL WVGERTRQLD HAHVDFVARI
ANPIGVKLGP AASPHAAIEL CERLNPENLP GRLTLISRMG NRQVRDVFPA IVDKVTAAGA
KVVWQCDPMH GNTEQSSHGF KTRRLDRVVD ELLGYFDVHR SLGTHPGGVH VELTGENVTE
CLDGIRGVED QHLPDRYETA CDPRLNMRQS LELALLVAEI LRG