Gene Strop_1584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1584 
Symbol 
ID5058042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1802043 
End bp1803161 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content69% 
IMG OID640473857 
Producthypothetical protein 
Protein accessionYP_001158428 
Protein GI145594131 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.654532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.384648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGATG CTGAGGGGTT CGTCGTCGCC GCCGTCGGGG ATCTGCTCAT CGTGCGGGAC 
CGCCCGCACG ACATCTTTCG ACACGCGCGG GAGCAGCTGG CCGGGGCCGA CATCACCTTC
GGCCAACTGG AGACCGCGTA CGCCGACCAG GGGTCCCGGG GCTCCTCCGG GCCGCGCGGC
GCCGTCCCCC ACGACGTGGC GAACTATGCC GCCATCCCGC ACGCGGGCTT CGACGTCATC
TCGATGGCGA GCAACCACAC CGGTGACTGG GGTGCTGACG CGTTGCTCGA CTGCATCGGG
CGGCTCCGGC GCGATGGCAT CACCGTGGTG GGTGCCGGAG CCGACATCGA CGAGGCCCGG
CGGCCGGGGA TCATCGAGCG CGATGGCACC CGGGTCGGGT TCCTGGCCTA CTGTTCGGTC
GCGCCGGAGG GCTACTACGC CGGGCGGGAC AAGCACGGGG TGGCGCCGAT GCGGGCGATG
ACGCACTACG AACCGTTCGA GTCCGACCAG CCCGGCGGTC CGCCCCTGAT CTCCACTTTC
ACCAACGACG CCGATTTGGC GGCGCTCACC GCGGACATCT CCCGGCTGCG GGACCAGGTG
GACGTGCTGC TCGTGTCACT CCACTGGGGC CTGCACTTCC AGCGCGCGAG GCTCGCCGAC
TATCAGCCGG TGGTGGCCCA TGCCGCGATC GACGCCGGTG CGGACGCGGT GCTCGGGCAC
CATCCGCACA TCCTCAAACC GGTCGAGGTC TACCAGGGCA AGGTGATCTT CTACAGCCTC
GGCAACTTCG CCCTCGACCT CAACGATTCC TGGTGGCGGT CATTCAGTCG GGAATGGCTC
GAAGAAGCCA AGGCGTTCCA CGAGGCGCTC TCCCCCGAAC GGGATCTGAA GGCGGAGGGA
CGGAACTCGG CGATCGTCCG GCTGCACATC GCCGACGGCG GCGTCAGCCG GGTCGAGATC
CTGCCCGTGG TGATCAATGA GGAGAACGAG CCGGTGCCGT ACCGGGCGGA CACGCCCGAG
GGGCGTGCGG TCCGCGACTA CCTGGCGGAG ATCACGGCGG AGGCGGGGAT GAACACCGCC
TTCGACGTCG TTGACGACAG GGTTCTGGTT CGCATCTGA
 
Protein sequence
MGDAEGFVVA AVGDLLIVRD RPHDIFRHAR EQLAGADITF GQLETAYADQ GSRGSSGPRG 
AVPHDVANYA AIPHAGFDVI SMASNHTGDW GADALLDCIG RLRRDGITVV GAGADIDEAR
RPGIIERDGT RVGFLAYCSV APEGYYAGRD KHGVAPMRAM THYEPFESDQ PGGPPLISTF
TNDADLAALT ADISRLRDQV DVLLVSLHWG LHFQRARLAD YQPVVAHAAI DAGADAVLGH
HPHILKPVEV YQGKVIFYSL GNFALDLNDS WWRSFSREWL EEAKAFHEAL SPERDLKAEG
RNSAIVRLHI ADGGVSRVEI LPVVINEENE PVPYRADTPE GRAVRDYLAE ITAEAGMNTA
FDVVDDRVLV RI